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@ Procsryotlc cBrbcnyi hydrolases, methods, DMA, vectors and transformed hosts for producing them, 
composition* containing them. 

@ Methods and vectors are provided for the production of 
procaryotic carbony) bydrolasesfn recombinant systems. DMA 
which encodes such hydrolases are mutated at predetermined 
regions by known methods or by a novel point mutagenesis 
method in order to generate mutant hydrolases. Particular 
point mutations incarbonyi hydrolases such assubtilisin result 
in modifications of oxidation stability, Km, Kcat, Kcat/Km ratio 
substrate specificity, specific activity or pH-activity profiles. 
These* mutated hydrolases are particularly useful in laundry 
compositions. Mut*tioos in the genes encoding the subtiiisin or 
neutral protease of bacillus yield substantially normally sporu- 
iating bacillus swains which are incapable of excreting subtii- 
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100/157 , 241 , 242 , 243 , 247 ,248 



PROCARYOTiCXARBONYL-, HYDROLASES, > METHODS, . DMA, 
VECTORS AND TRANSFORMED HOSTS FOR PRODUCING 
THEM, -AMD DETERGENT COMPOSITIONS CONTAINING THEM 



Background 

This Invention relates to the production and manipulation of 
proteins using recombinant techniques in suitable hosts. More 
specifically, the invention relates to the production of procaryotic 
proteases such as subtilisin and neutral protease using recombinant 
microbial host cells, to the synthesis of heterologous proteins by 
microbial hosts, and to the directed mutagenesis of enzymes in order 
to modify the characteristics thereof. 

Various bacteria are known to secrete proteases at some stage in 
their life-cycles. Bacillus species produce two major extracellular 
proteases, a neutral protease {a metal loprotease inhibited by EDTA) 

and an alkaline protease {or subtilisin, a serine endoprotease}. 

i 

Both generally are produced in greatest quantity after the 
exponential growth phase, when the culture enters stationary phase 
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and begins the process of spoliation. The physiological role of 
these two proteases is not clear. They have been postulated to play 
a role in spoliation (J. Hoch, 1976, "Adv. Genet." 18:69-98; 
P. Piggot et aT., 1976, "Bact. Rev." 40:908-962; and F. Priest, 

5 1977, "Bact. Rev." 41:711-753}, to be Involved in the regulation of 
cell wall turnover (L. Jolliffe et al ♦ , 1980, "J, Bact." 
141:1199-1208), and to be scavenger enzymes (Priest, Id.). The 
regulation of expression of the protease genes is complex. They 
appear to be coordinate!/ regulated in concert with sporulation, 

10 since mutants blocked in the early stages of sporulation exhibit 
reduced 'levels of both the alkaline and neutral protease. 
Additionally, a number of pleiotropic mutations exist which affect 
the level of expression of proteases and other secreted gene 
products, such as amylase and levansucrase {Priest, Id.). 

15 

Subtil i sin has found considerable utility in industrial and 
commercial applications (see U.S. Patent No. 3,623,957 and 
J, Millet, 1970, M. Appl. Bact." 33:207). For example, subtil Isins 
and other proteases are commonly used in detergents to enable 
20 removal of protein-based stains. They also are used in food 

processing to accommodate the proteinaceous substances present in 
the food preparations to their desired impact on the composition. 

Classical mutagenesis of bacteria with agents such as radiation 
25 or chemicals has produced a plethora of mutant strains exhibiting 
different properties with respect to the growth phase at which 
protease excretion occurs as well as the timing and activity levels 
of excreted protease. These strains, however, do not approach the 
ultimate potential of the organisms because the mutagenic process is 
30 essentially random, with- tedious selection and screening required to 
identify organisms which even approach the desired characteristics. 
Further, these mutants are capable of reversion to the parent or 
wild- type strain. In such event the desirable property is lost. 
The probability of reversion is unknown when dealing with random 
35 mutagenesis since the type and site of mutation is unknown or poorly 
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characterized. This introduces considerable uncertainty Into the 
industrial process which is based on the enzyme-synthesizing 
bacterium. Finally, classical mutagenesis frequently couples a 
desirable phenotype, e.g., low protease levels, with an undesirable 
5 character such as excessive premature cell lysis. 

Special problems exist with respect to the proteases which are 
excreted by Bacillus, For one thing, since at least two such 
proteases exist, screening for the loss of only one is difficult. 
10 Additionally, the large number of pleiotropic mutations affecting 
both spoliation and protease production make the isolation of true 
protease mutations difficult. 

Temperature sensitive mutants of the neutral protease gene have 
15 been obtained by conventional mutagenic techniques, and were used to 
map the position of the regulatory and structural gene in the 
Bacillus subtil is chromosome (H. Uehara et al_., 1979, "J. Bact." 
139:583-590). Additionally, a presumed nonsense mutation of the 
alkaline protease gene has been reported (C. Roitsch et al ., 1983, 
20 "u\ 8act. M 155:145-152). 

Bacillus temperature sensitive mutants have been isolated that 
produce inactive serine protease or greatly reduced levels of serine 
protease. These mutants, however, are asporogenous and show a 

25 reversion frequency to the wild-type of about from 10"' to 10 

(F. Priest, Id. p. 719). These mutants are unsatisfactory for the 
recombinant production of heterologous proteins because asporogenous 
mutants tend to lyse during earlier stages of their growth cycle in 
minimal medium than do sporogenic mutants, thereby prematurely 

30 releasing cellular contents (including intracellular proteases) into 
the culture supernatant. The possibility of reversion also is 
undesirable since wild- type revertants will contaminate the culture 
supernatant with excreted proteases. 



35 
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Bacillus sp. have been proposed for the expression of 
heterologous proteins, but the presence of excreted proteases and 
the potential resulting hydrolysis of the desired product has 
retarded the commercial acceptance of Bacillus as a host for the 
5 expression of heterologous proteins. Baci 1 1 us megatari urn mutants 
have been disclosed that are capable of sporulation and which do not 
express a sporulation-associated protease during growth phases. 
However, the assay employed did not exclude the presence of other 
proteases, and the protease in question is expressed during the 

10 spoliation phase (C. Loshon et aK> 1982, "0- Bact." 150:303-311). 
This, of course, is the point at which heterologous protein would 
have accumulated in the culture and be vulnerable. It is an 
objective herein to construct a Bacillus strain that is 
substantially free of extracellular neutral and alkaline protease 

15 during all phases of its growth cycle and which exhibits 

substantially normal sporulation characteristics. A need exists for 
non-re vertible, otherwise normal protease deficient organisms that 
can then be transformed with high copy number plasmids for the 
expression of heterologous or homologous proteins. 

20 

Enzymes having characteristics which vary from available stock 
are required. In particular, enzymes having enhanced oxidation 
stability will be useful in extending the shelf life and bleach 
compatibility of proteases used in laundry products. Similarly, 
25 reduced oxidation stability would be useful in industrial processes 
that require the rapid and efficient- quenching of enzymatic activity. 

Modifying the pH-activity profiles of an enzyme would be useful 
in making the enzymes more efficient In a wide variety of processes, 
30 e.g. broadening the pH-activity profile of a protease would produce 
an enzyme more suitable for both alkaline and neutral laundry 
products. Narrowing the profile, particularly when combined with 
tailored substrate specificity, would make enzymes in a mixture more 

compatible, as will be further described herein. 

35 

0992Y 



-5- 



0130756 



Mutations of procaryotic carbonyl hydrolases {principally 
proteases but including lipases) will facilitate preparation of a 
variety of different hydrolases, particularly those having other 
modified properties such as Km, Kcat, Km/Kcat ratio and substrate 
5 specificity. These enzymes can then be tailored for the particular 
substrate which is anticipated to be present, for example in the 
preparation of peptides or for hydrolytic processes such as laundry 
uses. 

10 Chemical modification of enzymes is known. For example, see I. 
Svendseir, 1976, "Carls berg Res. Commun." 41_ {5): 237-291. These 
methods, however, suffer from the disadvantages of being dependent 
upon the presence of convenient amino acid residues, are frequently 
nonspecific in that they modify all accessible residues with common 

15 side chains, and are not capable of reaching inaccessible amino acid 
residues without further processing, e.g. denaturati on, that is 
generally not completely reversible in reinstituting activity. To 
the extent that such methods have the objective of replacing one 
amino acid residue side chain for another side chain or equivalent 

20 functionality, then mutagenesis promises to supplant such methods. 

Predetermined, site-directed mutagenesis of tRNA synthetase in 
which a cys residue is converted to serine has been reported 
{G. Winter eta]_., 1982, "Mature" 299:756-758; A. Wilkinson et al_., 
25 1984, "Mature" 307:187-188}. This method is not practical for large 
scale mutagenesis. It is an object herein to provide a convenient 
and rapid method for mutating DMA by saturation mutagenesis. 



30 Summary 



A method for producing procaryotic carbonyl hydrolase such as 
subtil isin and neutral protease in recombinant host cells is 
described in which expression vectors containing sequences which 
35 encode desired subtil isin or neutral protease, including the pro, 
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pre, or prepro forms of these enzymes, are used to transform hosts, 
the host cultured and desired enzymes recovered. The coding 
sequence may correspond exactly to one found In nature, or may 
contain modifications which confer desirable properties on the 
protein that is produced, as is further described below. 

The novel strains then are transformed with at least one dUA 
moiety encoding a polypeptide not otherwise expressed in the host 
strain, the transformed strains cultured and the polypeptide 
recovered from the culture. Ordinarily, the DNA moiety is a 
directed. mutant of a host Bacillus gene, although it may be DMA 
encoding a eucaryotic (yeast or mammalian) protein- The novel 
strains also serve as hosts for protein expressed from a bacterial 
gene derived from sources other than the host genome, or for vectors 
expressing these heterologous genes, or homologous genes from the 
host genome. In the latter event enzymes such as amylase are 
obtained free of neutral protease or subtil isin. In addition, it is 
now possible to obtain neutral protease in culture which is free of 
enzymatically active subtilisin, and vice-versa. 

One may* by splicing the cloned genes for procaryotic carbonyl 
hydrolase into a high copy number plasmid, synthesize the enzymes in 
enhanced yield compared to the parental organisms. Also disclosed 
are modified forms of such hydrolases, including the pro and prepro 
zymogen forms of the enzymes, the pre forms, and directed mutations 
thereof. 

A convenient method is provided for saturation mutagenesis, 
thereby enabling the rapid and efficient generation of a plurality 
of mutations at any one site within the coding region of a protein, 
comprising; 

{a} obtaining a DMA moiety encoding at least a portion of 
said precursor proteins 
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{b} identifying a region within the moiety; 

(c) substituting nucleotides for those already existing 
within the region in order to create at least one 
5 restriction enzyme site unique to the moiety, whereby unique 

restriction sites 5' and 3' to the identified region are 
made available such that neither alters the amino acids 
coded for by the region as expressed; 

10 (d) synthesizing a plurality of oligonucleotides, the 5 1 and 

<5 l ends of which each contain sequences capable of annealing 
to the restriction enzyme sites introduced in step (c) and 
which, when li gated to the moiety, are expressed as 
• substitutions, deletions and/or insertions of at least one 
15 amino acid in or into said precursor protein; 

(e) digesting the moiety of step {c) with restriction 
enzymes capable of cleaving the unique sites; and 

20 £f) ligating each of the oligonucleotides of step (d) into 

the digested moiety of step {e) whereby a plurality of 
mutant DNA moieties are obtained. 



By the foregoing method or others known in the art, a mutation 
25 is introduced into isolated DNA encoding a procaryotic carbonyl 
hydrolase which, upon expression of the DNA, results in the 
substitution, deletion or insertion of at least one amino acid at a 
predetermined site in the hydrolase. This method is useful in 
creating mutants of wild type proteins {where the "precursor" 
30 protein is the wild type) or reverting mutants to the wild type 
(where the "precursor" i*s the mutant. 

Mutant enzymes are recovered which exhibit oxidative stability 
and/or pH-activity profiles which differ from the precursor 
35 enzymes. Procaryotic carbonyl hydrolases having varied Km, Kcat, 
Kcat/Km ratio and substrate specificity also are provided herein. 
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The mutant enzymes obtained by the methods herein are combined 
in known fashion with surfactants or detergents to produce novel 
compositions useful In the laundry or other cleaning arts. 

Brief Description of the Drawing 

Figure 1 shows the sequence of a functional B_. aroyl ol iquefaciens 
subtil i sin gene. 

In Figure 1A, the entire functional sequence for B_. 
amylol iqu efaciens , including the promoter and rihosome binding site, 
are present on a 1.5 kb fragment of the B. a mylo l Iquefaciens genome. 

Figure IB shows the nucleotide sequence of the coding strand, 
correlated with the amino acid sequence of the protein. Promoter 
(p) ri be-some binding site (rbs) and termination (term) regions of 
the DMA sequence are also shown. 

Figure 2 shows the results of replica nitrocellulose filters of 
purified positive clones probed with Pool 1 (Panel A) and Pool 2 
{Panel B) respectively. 

Figure 3 shows the restriction analysis of the subtil isin 
expression plasmid (pS4). pBS42 vector sequences (4.5 kb) are shown 
in solid while the insert sequence (4.4 kb) is shown dashed. 

Figure 4 shows, the results of SOS-PAGE performed on supernatants 
from cultures transformed with p8S42 and pS4. 

Figure 5 shows the construction of the shuttle vector p8S42. 

Figure 6 shows a restriction map for a sequence Including the B. 
subtil fs subtil isin gene. 
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Figure 7 is the sequence of a functional B. subtil is subtilisin 
gene. 

Figure 8 demonstrates a construction method for obtaining a 
deletion mutant of a B. sub t il is subtilisin gene. 

Figure 9 discloses the restriction map for a B_. subttlis neutral 
protease gene. 

Figure 10 is the nucleotide sequence for a B. subti 1 i s neutral 
protease gene. 

Figure 11 demonstrates the construction of a vector containing a 
H* subtil is neutral protease gene. 

Figures 12, 13 and 16 disclose embodiments of the mutagenesis 
technique provided herein. 

Figure 14 shows the enhanced oxidation stability of a subtilisin 
mutant. 

Figure 15 demonstrates a change in the pH-activity profile of a 
subtilisin mutant when compared to the wild type enzyme. 

Detail eri Desert pti on 

Procaryotic carbonyl hydrolases are enzymes which hydrolyze 
0 

compounds containing C-X bonds in which X is oxygen or nitrogen. 
They principally include hydrolases, e.g. lipases and peptide 
hydrolases, e.g. subti li sins or metal lop ro teases. Peptide 
hydrolases include a-aminoacyl peptide hydrolase, peptidyl ami no-acid 
hydrolase, acylamino hydrolase, serine carboxypeptidase, 
me ta 1 1 ocarboxypepti dase, thiol proteinase, carboxyl proteinase and 
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meta lloproteinase. Serine, metal! o, thiol and acid proteases are 
included, as well as endo and exo-pro teases. 

Subtil isins are serine proteinases which generally act to cleave 
5 internal peptide bonds of proteins or peptides. Metal lopro teases 
are exo~ or endoproteases which require a metal ion cofactor for 
activity. 

A number of naturally occurring mutants of subtilisin or neutral 
1Q protease exist, and all may be employed with equal effect herein as 
sources ior starting genetic material. 

These enzymes and their genes may be obtained from many 
procaryotic organisms. Suitable examples include gram negative 
1g organisms such as E. coli or pseudomonas and gram positive bacteria 
such as micrococcus or bacillus. 

The genes encoding the carbonyl hydrolase may be obtained in 
accord with the general method herein. As will be seen from the 

2Q examples, this comprises synthesizing labelled probes having 

putative sequences encoding regions of the hydrolase of interest, 
preparing genomic libraries from organising expressing the 
hydrolase, and screening the libraries for the gene of interest by 
hybridization to the probes. Positively hybridizing clones are then 

25 mapped and sequenced. The cloned genes are li gated into an 
expression vector {which also may be, the cloning vector} with 
requisite regions for replication in the host, the plasmid 
transfected into a host for enzyme synthesis and the recombinant 
host cells cultured under conditions favoring enzyme synthesis, 

30 usually selection pressure such as is supplied by the presence of an 
antibiotic, the resistance to which is encoded by the vector. 
Culture under these conditions results in enzyme yields multifold 
greater than the wild type enzyme synthesis of the parent organism, 
even if it is the parent organism that is transformed. 

35 
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"Expression vector" refers to a DNA construct containing a DNA 
sequence which is operably linked to a suitable control sequence 
capable of effecting the expression of said DMA in a suitable host. 
Such control sequences include a promoter to effect transcription, 
5 an optional operator sequence to control such transcription, a 
sequence encoding suitable tnRNA ri bosome binding sites, and 
sequences which control termination of transcription and 
translation. The vector may be a plasraid, a phage particle, or 
simply a potential genomic insert. Once transformed into a suitable 

10 host, the vector may replicate and function independently of the 
host genome, or may, in some instances, integrate into the genome 
itself. In the present specification, "plasmid" and "vector" are 
sometimes used interchangeably as the plasmid is the most commonly 
used form of vector at present. However, the invention 

15 is intended to include such other forms of expression vectors which 
serve equivalent functions and which are, or become, known in the 
art. 

"Recombinant host cells" refers to cells which have been 
20 transformed or transfected with vectors constructed using recombinant 
QUA techniques. As relevant to the present invention, recombinant 
host cells are those which produce procaryotic carbonyl hydrolases 
in its various forms by virtue of having been transformed with 
expression vectors encoding these proteins. The recombinant host 
25 cells may or may not have produced a form of carbonyl hydrolase 
prior to transformation. 

"Operably linked" when describing the relationship between two 
DMA regions simply means that they are functionally related to each 

30 other. For example, a p resequence is operably linked to a peptide 
if it functions as a signal sequence, participating in the secretion 
of the mature form of the protein most probably involving cleavage 
of the signal sequence. A promoter 1s operably linked to a coding 
sequence if it controls the transcription of the sequence; a 

35 nbosome binding site is operably linked to a coding sequence if it 
is positioned so as to permit translation. 
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"Prohydrolase" refers to a hydrolase which contains additional 
N- terminal amino acid residues which render the enzyme inactive but, 
when removed, yield an enzyme. Many proteolytic enzymes are found 
in nature as translations! proenzyme products and, in the absence of 
post- trans! ational products, are expressed in this fashion. 

"Presequence" refers to a signal sequence of amino acids bound 
to the N-terminal portion of the hydrolase which may participate in 
the secretion of the hydrolase, ^resequences also may be modified 
in the same fashion as is described here, including the introduction 
of predetermined mutations* When bound to a hydrolase, the subject 
protein becomes a "prehydrolase"- Accordingly, relevant 
prebydrolase for the purposes herein are presubtil isin and 
preprosubtilisin. Prehydrolases are produced by deleting the "pro" 
sequence {or at least that portion of the pro sequence that 
maintains the enzyme in its inactive state) from a prepro coding 
region, and then expressing the prehydrolase. In this way the 
organism excretes the active rather than proenzyme. 

The cloned carbonyl hydrolase is used to transform a host cell 
in order to express the hydrolase. This will be of interest where 
the hydrolase has commercial use in its unmodified form, as for 
example subtil isin in laundry products as noted above. In the 
preferred embodiment the hydrolase gene is ligated into a high copy 
number plasmid. This plasmid replicates in hosts in the sense that 
it contains the well-known elements necessary for plasmid 
replication; a promoter operably linked to the gene in question 
(which may be supplied as the gene's own homologous promo tor if it 
is recognized, i.e., transcribed, by the host), a transcription 
termination and polyadenylation region (necessary for stability of 
the mRNA transcribed by the host from the hydrolase gene) which is 
exogenous or is supplied by the endogenous terminator region of the 
hydrolase gene and, desirably 3 a selection gene such as an 
antibiotic resistance gene that enables continuous cultural 
maintenance of plasraid-infected host cells by growth in 
0992Y 
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antibiotic-containing media* High copy number plasmids also contain 
an origin of replication for the host, thereby enabling large 
numbers of plasnrids to be generated in the cytoplasm without 
chromosonal limitations. However, it is within the scope herein to 
5 integrate multiple copies of the hydrolase gene into host genome. 
This is facilitated by bacterial strains which are particularly 
susceptible to homologous recombination. The resulting host cells 
are termed recombinant host cells. 

Once the carbonyl hydrolase gene has been cloned, a number of 
modifications are undertaken to enhance the use of the gene beyond 
synthesis of the wild type or precursor enzyme. A precursor enzyme 
is the enzyme prior to its modification as described in this 
application. Usually the precursor is the enzyme as expressed by 
the organism which donated the DMA modified in accord herewith. The 
term "precursor" is to be understood as not implying that the 
product enzyme was the result of manipulation of the precursor 
enzyme j>er se. 

20 In the first of these modifications, the gene may be deleted 
from a recombination positive (rec + ) organism containing a 
homologous gene. This is accomplished by recombination of an in 
vitro deletion mutation of the cloned gene with the genome of the 
organism. Many strains of organisms such as E.coli and Bacillus are 

25 known to be capable of recombination. All that is needed is for 
regions of the residual DHA from the deletion mutant to recombine 
with homologous regions of the candidate host. The deletion may be 
within the coding region {leaving enzymatically inactive 
polypeptides) or include the entire coding region as long as 

30 homologous flanking regions {such as promoters or termination 
regions) exist in the host. Acceptability of the host for 
recombination deletion mutants is simply determined by screening for 
the deletion of the transformed phenotype. This is most readily 
accomplished in the case of carbonyl hydrolase by assaying host 

35 cultures for loss of the ability to cleave a chromogenic substrate 
otherwise hydroly zed by the hydrolase. 
0992Y 
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Transformed hosts contained the protease deletion mutants are 
useful for synthesis of products which are incompatible with 
proteolytic enzymes. These hosts by definition are incapable of 
excreting the deleted proteases described herein, yet are 
5 substantially normally sporulating* Also the other growth 
characteristics of the transformants are substantially like the 
parental organism. Such organisms are useful in that it is expected 
they will exhibit comparatively less inactivation of heterologous 
proteins than the parents, and these hosts do have growth 

10 characteristics superior to known protease-deficient organisms. 
However/ the deletion of neutral protease and subtil isin as 
described in this application does not remove all of the proteolytic 
activity of Bacillus. It is believed that intracellular proteases 
which are not ordinarily excreted extracellularly "leak" or diffuse 

15 from the cells during late phases of the culture. These 

intracellular proteases may or stay not be subtil isin or neutral 
protease as those enzymes are defined herein. Accordingly, the 
novel Bacillus strains herein are incapable of excreting the 
subtil isin and/or neutral protease enzymes which ordinarily are 

20 excreted extracellularly in the parent strains. "Incapable" means 
not revertible to the wild type. Reversion is a finite probability 
that exists with the heretofore known protease-deficient, naturally 
occurring strains since there is no assurance that the phenotype of 
such strains is not a function of a readily revertible mutation, 

25 e.g. a point mutation. This to be contrasted with the extremely 
large deletions provided herein. 

The deletion mutant-transformed host cells herein are free of 
genes encoding enzymatically active neutral protease or subtil isin, 
30 which genes are defined as those being substantially homologous with 
the genes set forth in Figs. 1, 7 or 10. "Homologous" genes contain 
coding regions capable of hybridizing under high stringency 
conditions with the genes shown in Hgs» l» 7 or 10. 



35 
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The microbial strains containing carbonyl hydrolase deletion 
mutants are useful in two principal processes. In one embodiment 
they are advantageous in the fermentative. production of products 
ordinarily expressed by a host that are desirably uncontaminated 
s with the protein encoded by the deletion gene. An example is 
f erraentati ve synthesis of amylase, where contaminant proteases 
interfere in many industrial uses for amylase. The novel strains 
herein relieve the art from part of the burden of purifying such 
products free of contaminating carbonyl hydrolases. 

10 

In a* second principal embodiment, subtilisin and neutral 
protease deletion-mutant strains are useful in the synthesis of 
protein which is not otherwise encoded by the strain. These 
proteins will fall within one of two classes. The first class 

15 consists of proteins encoded by genes exhibiting no substantial 
p retrans forma ti on homology with those of the host. These may be 
proteins from other procaryotes but ordinarily are eucaryotic 
proteins from yeast or higher eucaryotic organisms, particularly 
mammals. The novel strains herein serve as useful hosts for 

20 expressible vectors containing genes encoding such proteins because 
the probability for proteolytic degradation of the expressed, 
non-homologous proteins is reduced. 

The second group consists of mutant host genes exhibiting 
25 substantial pretransformation homology with those of the host. 
These include mutations of procaryotic carbonyl hydrolases such as 
subtilisin and neutral protease, as well as microbial {rennin, for 
example rennin from the genus Mucor) . These mutants are selected in 
order to improve the characteristics of the precursor en?yrae for 
30 industrial uses. 

A novel method is provided to facilitate the construction and 
identification of such mutants. First, the gene encoding the 
hydrolase is obtained and sequenced in whole or in part. Then the 
35 sequence is scanned for a point at which it is desired to make a 
0992V 
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mutation {deletion* insertion or substitution) of one or more amino 
acids in the expressed enzyme. The sequences flanking this point 
are evaluated for the presence of restriction sites for replacing a 
short segment of the gene with an oligonucleotide pool which when 
5 expressed will encode various mutants. Since unique restriction 
sites are generally not present at locations within a convenient 
distance from the selected point (from 10 to 15 nucleotides) , such 
sites are generated by substituting nucleotides in the gene in such 
a fashion that neither the reading frame nor the amino acids encoded. 

to are changed in the final construction. The task of locating 
suitable flanking regions and evaluating the needed changes to 
arrive at two unique restriction site sequences is made routine by 
the redundancy of the genetic code, a restriction enzyme map of the 
gene and the large number of different restriction enzymes. Note 

15 that if a fortuitous flanking unique restriction site is available, 
the above method need be used only in connection with the flanking 
region which does not contain a site. 

Mutation of the gene in order to change its sequence to conform 
20 to the desired sequence is accomplished by Ml 3 primer extension in 
accord with generally known methods. Once the gene is cloned, it is 
digested with the unique restriction enzymes and a plurality of end 
termini -complementary oligonucleotide cassettes are 1 i gated into the 
unique sites. The mutagenesis is enormously simplified by this 
25 method because all of the oligonucleotides can be synthesized so as 
to have the same restriction sites,. and no synthetic linkers are 
necessary to create the restriction sites. 

The number of commercially available restriction enzymes having 
30 sites not present in the gene of interest is generally large. A 

suitable DNA sequence computer search program simplifies the task of 
finding potential 5* and 3' unique flanking sites. A primary 
constraint is that any mutation introduced in creation of the 
restriction site must be silent to the fina l constructed amino acid 
35 coding sequence. For a candidate restriction site 5" to the target 
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codon a sequence must exist in the gene which contains at least all 
the nucleotides but for one in the recognition sequence 5 1 to the 
cut of the candidate enzyme- For example, the blunt cutting enzyme 
Smal (CCC/GGG) would be a 5' candidate if a nearby 5' sequence 
contained NCC, CMC, or CCN. Furthermore, if H needed to be altered 
to C this alteration must leave the amino acid coding sequence 
intact. In cases where a permanent silent mutation is necessary to 
introduce a restriction site one may want to avoid the introduction 
of a rarely used codon. A similar situation for Smal would apply 
for 3' flanking sites except the sequence NGG, GMG, or GGM must 
exist, .The criteria for locating candidate enzymes is most relaxed 
for blunt cutting enzymes and most stringent for 4 base overhang 
enzymes. In general many candidate sites are available. For the 
codon-222 target described herein a Ball site (TGG/CCA) could have 
been engineered in one base pair 5* from the Kpnl site, A3' EcoRV 
site { GAT/A TC) could have been employed 11 base pairs 5' to the PstI 
site. A cassette having termini ranging from a blunt end up to a 
four base-overhang will function without difficulty. In retrospect, 
this hypothetical EcoRV site would have significantly shortened the 
oligonucleotide cassette employed (9 and 13 base pairs) thus 
allowing greater purity and lower pool bias problems. Flanking 
sites should obviously be chosen which cannot themselves li gate so 
that ligation of the oligonucleotide cassette can be assured in a 
single orientation. 

The mutation per se need not be predetermined. For example, an 
oligonucleotide cassette or fragment is randomly mutagenized with 
nitrosoguanidine or other mutagen and then in turn li gated into the 
hydrolase gene at a predetermined location. 

The mutant carbonyl hydrolases expressed upon transformation of 
the suitable hosts are screened for enzymes exhibiting desired 
characteristics, e.g. substrate specificity, oxidation stability, 
pH-activity profiles and the like. 
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A change in substrate specificity is defined as a difference 
between the Kcat/Km ratio of the precursor enzyme and that of the 
mutant. The Kcat/Km ratio is a measure of catalytic efficiency. 
Procaryotic carbonyl hydrolases with increased or diminished Kcat/Km 
5 ratios are described in the examples. Generally, the objective will 
be to secure a mutant having a greater {numerically larger) Kcat/Km 
ratio for a given substrate, thereby enabling the use of the enzyme 
to more efficiently act on a target substrate. An increase in 
Kcat/Km ratio for one substrate may be is accompanied by a reduction 
10 in Kcat/Km ratio for another substrate. This is a shift in 
substrate specificity, and mutants exhibiting such shifts have 
utility where the precursors are undesirable, e.g. to prevent 
undesi red hydrolysis of a particular substrate in an admixture of 
substrates. 

15 

Keat and Km are measured in accord with known procedures, or as 
described 1n Example 18. 

Oxidation stability is a further objective which is accomplished 
20 by mutants described in the examples. The stability may be enhanced 
or diminished as is desired for various uses. Enhanced stability is 
effected by deleting one or more methionine, tryptophan, cysteine or 
lysine residues and, optionally, substituting another amino acid 
residue not one of methionine, tryptophan, cysteine or lysine. The 
25 opposite substitutions result in diminished oxidation stability. 
The substituted residue is preferably alanyl , but neutral residues 
also are suitable. 

Mutants are provided which exhibit modified pH-activity 
30 profiles. A pH-activity profile is a plot of pH against enzyme 
activity and may be constructed as illustrated in Example 19 or by 
methods known in the art. It may be desired to obtain mutants with 
broader profiles, i.e., those having greater activity at certain pH 
than the precursor, but no significantly greater activity at any pH, 
35 or mutants with sharper profiles, i.e. those having enhanced 
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activity when compared to the precursor at a given pH, and lesser 
activity elsewhere. 

The foregoing mutants preferably are made within the active site 
of the enzyme as these. mutations are most likely to influence 
activity. However, mutants at other sites important for enzyme 
stability or conformation are useful. In the case of Bacillus 
subtil isin or its pre, prepro and pro forms, mutations at tyrosine-1, 
aspartate+32, asparagine+155, tyrosine+104, methionine+222, 
glycine+166, histidine+64, glycine+169, phenyl alanine+189, serine+33, 
serine+221, tyrosine+217, glutamate+156 and/or alanine+152 produce 
mutants having changes in the characteristics described above or in 
the processing of the enzyme. Note that these amino acid position 
numbers are those assigned to 8. amyloliquefaciens subtil isin as 
seen from Fig. 7. It should be understood that a deletion or 
insertion in the N-terminal direction from a given position will 
shift the relative amino acid positions so that a residue will not 
occupy its original or wild type numerical position. Also, allelic 
differences and the variation among various procaryotic species will 
result in positions shifts, so that position 169 in such subtilisins 
will not be occupied by glycine. In such cases the new positions 
for glycine will be considered equivalent to and embraced within the 
designation glycine+169. The new position for glycine+169 is 
readily identified by scanning the subtil isin in question for a 
region homologous to glycine+169 in Fig. 7. 

One or more, ordinarily up to about 10, amino acid residues may 
be mutated. However, there is no limit to the number of mutations 
that are to be made aside from commercial practicality. 

The enzymes herein may be obtained as salts. It is clear that 
the ionization state of a protein will be dependent on the pH of the 
surrounding medium, if it is in solution, or of the solution from 

which it is prepared, if it is in solid form. Acidic proteins are 
commonly prepared as, for example, the ammonium, sodium, or 
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potassium salts; basic proteins as the chlorides, sulfates, or 
phosphates." Accordingly, the present application includes both 
electrically neutral and salt forms of the designated carbony! 
hydrolases, and the term carbony 1 hydrolase refers to the organic 
5 structural backbone regardless of ionization state. 

The mutants are particularly useful in the food processing and 
cleaning arts. The carbony 1 hydrolases, including mutants, are 
produced by fermentation as described herein and recovered by 

10 suitable techniques. See for example K. Anstrup, 1974, Industrial 
Aspects of Biochemistry , ed. B. Spencer pp. 23-46. They are 
formulated with detergents or other surfactants in accord with 
methods known per se for use in industrial processes, especially 
laundry. In the latter case the enzymes are combined with 

15 detergents, builders, bleach and/or fluorescent whitening agents as 
is known in the art for proteolytic enzymes. Suitable detergents 
include linear alkyl benzene sulfonates, alky! ethoxylated sulfate, 
sulfated linear alcohol or ethoxylated linear alcohol. The 
compositions may be formulated in granular or liquid form. See for 

20 example U.S Patents 3,623,957; 4,404,128; 4,381,247; 4,404,115; 
4,318,818; 4,261,868; 4,242,219; 4,142,999; 4,111,855; 4,011,169; 
4,090,973; 3,985,686; 3,790,482; 3,749,671; 3,560,392; 3,558,498; 
and 3,557,002. 

25 The following disclosure is intended to serve as a 

representation of embodiments herein, and should not be construed as 
limiting the scope of this application- 



30 Gl ossary of Experimental in i H ani pul ations 

In order to simplify the Examples certain frequently occurring 
methods will be referenced by shorthand phrases. 
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Plasroids are designated by a small p preceeded and/or followed 
by capital letters and/or numbers. The starting plasiaids herein are 
commercially available, are available on an unrestricted basis, or 
can be constructed from such available plasmids in accord with 
5 published procedures. 

"Klenow treatment" refers to the process of filling a recessed 
3' end of double stranded DMA with deoxy ribonucleotides 
complementary to the nucleotides making up the protruding 5' end of 

10 the DMA strand. This process is usually used to fill in a recessed 
end resulting from a restriction enzyme cleavage of DMA. This 
creates a blunt or flush end, as may be required for further 
ligations. Treatment with Klenow is accomplished by reacting 
{generally for 15 minutes at 15*C) the appropriate complementary 

15 deoxy ribonucleotides with the DMA to be filled in under the 
catalytic activity (usually 10 units) of the Klenow fragment of 
JL* Hli ®NA polymerase I ("Klenow"). Klenow and the other reagents 
needed are commercially available. The procedure has been published 
extensively. See for example T. Maniatis et al., 1982, Molecular 

20 Cloning , pp. 107-108. 

"Digestion" of DMA refers to catalytic cleavage of the DMA with 
an enzyme that acts only at certain locations in the DNA. Such 
enzymes are called restriction enzymes, and the sites for which each 

25 is specific is called a restriction site- "Partial" digestion 
refers to Incomplete digestion by a restriction enzyme, i.e., 
conditions are chosen that result in cleavage of some but not all of 
the sites for a given restriction endonuclease in a DMA substrate. 
The various restriction enzymes used herein are commercially 

30 available and their reaction conditions, cofactors and other 
requirements as established by the enzyme suppliers were used. 
Restriction enzymes commonly are designated by abbreviations 
composed of a capital letter followed by other letters and then, 
generally, a number representing the microorganism from which each 

35 restriction enzyme originally was obtained. In general, about 1 ug 
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of plasmid or DMA fragment is used with about 1 unit of enzyme in 
about 20 til of buffer solution. Appropriate buffers and substrate 
amounts for particular restriction enzymes are specified by the 
manufacturer. Incubation times of about 1 hour at 37°C are 

5 ordinarily used, but may vary in accordance with the supplier's 
instructions. After incubation, protein is removed by extraction 
with phenol and chloroform, and the digested nucleic acid is 
recovered from the aqueous fraction by precipitation with ethanol . 
Digestion with a restriction enzyme infrequently is followed with 

10 bacterial alkaline phosphatase hydrolysis of the terminal 5' 
phosphates to prevent the two restriction cleaved ends of a DMA 
fragment from "circularizing" or forming a closed loop that would 
impede insertion of another DMA fragment at the restriction site. 
Unless otherwise stated, digestion of pi asmids is not followed by 5' 

15 terminal dephosphorylation. Procedures and reagents for 

dephosphorylation are conventional (T. Maniatis et aK, Id., 
pp. 133-134). 

"Recovery" or "isolation" of a given fragment of BNA from a 
20 restriction digest means separation of the digest on 6 percent 

polyacryl amide gel electrophoresis, identification of the fragment 
of interest by molecular weight (using DMA fragments of known 
molecular weight as markers), removal of the gel section containing 
the desired fragment, and separation of the gel from DMA. This 
25 procedure is known generally. For example, see R. Lawn et al_., 
1981, "Nucleic Acids Res." 9:6103-6114, and D. Goeddel et aU, 
{1980) "Nucleic Acids Res." 8:4057. 

"Southern Analysis" is a method by which the presence of DMA 
30 sequences in a digest or DMA-containing composition is confirmed by 
hybridization to a known, labelled oligonucleotide or DMA fragment. 
For the purposes herein. Southern analysis shall mean separation of 
digests on 1 percent agarose and depuri nation as described by 
G. Wahl et al_., 1979, "Proc. Nat. Acad. Sci. U.S.A." 76:3683-3687, 
35 transfer to nitrocellulose by the method of E. Southern, 1975, 
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"J. MoT. Biol." 98:503-517, and hybridization as described by 
T. Maniatis et al_. , 1978, "Cell" 15:687-701. 

"Transformation" means introducing DNA into an organism so that 
5 the DNA is repHcable, either as an extrachromosomal element or 
chromosomal Integrant. Unless otherwise stated, the method used 
herein for transformation of E. col i is the CaClg method of Mandel 
et a]_., 1970, "J. Mol. Biol." 53:154, and for Bacillus, the method 
of Anagnostopolous et aJL, 1961, "J. Bact." 81:791-746. 

10 

"Ligation" refers to the process of forming phosphodiester bonds 
between two double stranded nucleic acid fragments (T. Maniatis 
et aK, Id., p. 146). Unless otherwise stated, ligation was 
accomplished using known buffers and conditions with 10 units of T4 
15 DMA ligase ("ligase") per 0.5 jig of approximately equiraolar amounts 
of the DNA fragments to be li gated. Plasmids from the transformants 
were prepared, analyzed by restriction mapping and/or sequenced by 
the method of Messing, et at,, 1981, "Nucleic Acids Res.", 9:309. 

20 "Preparation" of DNA from transformants means isolating plasmid 
DMA from microbial culture. Unless otherwise stated, the 
alkaline/SDS method of Maniatis et al_., Id. p. 90., was used. 

"Oligonucleotides" are short length single or double stranded 
25 polydeoxynucleo tides which were chemically synthesized by the method 
of Crea et aK, 1980, "Nucleic Acids -Res." 8:2331-2348 (except that 
mesityl ene nitrotHazole was used as a condensing agent) and then 
purified on polyacryl amide gels. 

30 All literature citations are expressly incorporated by reference. 
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Example 1 

Preparation of a Genomic OMA Library from 8. ■ affiyjpli qui fac lens 
and Isolation of its Subtil isin Gene 

The known amino acid sequence of the extracellular 
B. amyloliqu efaciens permits the construction of a suitable probe 
mixture. The sequence of the mature subtil isin is included {along 
with the additional information contributed by the present work] in 
Figure 1. All codon ambiguity for the sequence of amino acids at 
position. 117 through 121 is covered by a pool of eight 

oligonucleotides of the sequence AA{y)AA( j)ATGGA(^)GT. 

Chromosomal DNA isolated from 13. amyloliquefaciens (ATCC No. 
23844) as described by J. Harmur, Hoi, Biol. u , 3:208, was 
partially digested by Sau 3A, and the fragments size selected and 
li gated into the BamH 1 site of dephosphorylated pBS42. (pBS42 is 
shuttle vector containing origins of replication effective both in 
E. coll and Bacillus. It is prepared as described in Example 4.) 
The Sau3A fragment containing vectors were transformed into jE. coli 
K12 strain 294 (ATCC No. 31446) according to the method of M. 
Handel , et aj_., 1970, "J. Hoi. Bio." 53: 154 using 80-400 nanograms 
of library DNA per 250{iL of competent cells. 

Cells from the transformation mixture were plated at a 
density of 1-5 x 10 J transformants per 150mm plate containing LB 
medium + 12.5 pg/ml chloramphenicol, and grown overnight at 37'C 
until visible colonies appeared. The plates were then replica 
plated onto BA85 nitrocellulose filters overlayed on LB/chloram- 
phem'col plates. The replica plates were grown 10-12 hours at 37°C 
and the filters transferred to fresh plates containing LB and 
150 ug/ml spectinomycin to amplify the plasmid pool. 

After overnight incubation at 37*C, filters were processed 
essentially as described by Grunstein and Hogness, 1975, "Proc. 
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Natl. Acad. Sci. (USA)" 72: 3961. Out of approximately 20,000 
successful transforraants, 25 positive colonies were found. Eight of 
these positives were streaked to purify individual clones. 24 
clones from each streak were grown in microtiter wells, stamped on 
s to two replica filters, and probed as described above with either 

AACAA< £ ) ATGGA( £ )GT( pool 1} or AAT_AA{£)ATGGA(^)GT{pool 2) which differ 

by only one nucleotide. As shown in Figure 2, pool 1 hybridized to a 
much greater extent to all positive clones than did pool 2, suggesting 
1Q specific hybridization. 

Four out of five miniplasmid preparations (Maniatis et al . , 
Id.) from positive clones gave identical restriction digest patterns 
when digested with Sau3A or Hindi. The plasmid isolated from one of 
15 these four identical colonies by the method of Maniatis et al_. , Id., 
had the entire correct gene sequence and was designated pS4. The 
characteristics of this plasmid as determined by restriction analysis 
are shown In Figure 3. 

20 

Example 2 
Expression of the Subtil i sin Gene 

25 Bacillus subtil is 1-168 (Catalog No. 1-A1, Bacillus Genetic 

Stock Center) was transformed with pS4 and and a single chloramphenicol 
resistant transformant then grown in minimal medium. After 24 hours, 
the culture was centrifuged and both the supernatant {10-200 pi) and 
pellet assayed for proteolytic activity by measuring the change in 

30 absorbance per minute at 412 nra using 1 ml of the chroraogenic substrate 
succinyl-L-ala-ala-pro-phe-p-nitroanilide {0,2 M H> in Q.IK sodium 
phosphate {pH 8.0} at 2S*C. A B. subtil is 1-168 culture transformed 
with pSS42 used as a control showed less than 1/200 of the activity 
shown by the pS4 transformed culture. Greater than 95 percent of the 

35 protease activity of the pS4 culture was present in the supernatant, 
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and was completely inhibited by treatment with phenyl methyl sul fonyl 
fluoride (PMSF) but not by EDTA. 

Aliquots of the supernatants were treated with PMSF and EDTA to 
5 inhibit all protease activity and analyzed by 12 percent -'SDS-PAGE 

according to the method of Laemmli, U.K., 1970 "Nature", 222: 680. To 
prepare the supernatants, 16 uL of supernatant was treated with ImM 
PMSF, 10 m EDTA for 10 minutes, and boiled with 4 uL of 5x 
concentrated SOS sample buffer minus s-mercaptoethanol . The results 
10 of Coomassie stain on runs using supernatants of cells transformed 
with pS4\ pBS42, and un transformed B. amyloliquefaciens are shown in 
• Figure 4. Lane 3 shows authentic subtil i sin from B_. amyloliquefaciens. 
Lane 2 which is the supernatant from pBS42 transformed 8. subtil is , 
does not give the 31,000 MW band associated with subtil i sin which is 
15 exhibited by Lane 1 from pS4 transformed hosts. The approximately 
31,000 MW band result for subtil isin is characteristic of the slower 
mobility shown by the known M.W. 27,500 subtilisin preparations in 
general . 

20 

Example 3 

Sequencing of the 8. amyloliquefaciens Subtilisin Gene 

25 The entire sequence of an EcoRJ-BaraHI fragment (wherein the EcoRI 
site was constructed by conversion of the Hindi site) of pS4 was 
determined by the method of F. Sanger, 1977, "Proc. Natl. Acad. Sci 
(USA)", 74:5463. Referring to the restriction map shown in Figure 3, 
the BamHI-PvuII fragment was found to hybridize with pool 1 

30 oligonucleotides by Southern analysis. Data obtained from sequencing 
of this fragment directed the sequencing of the remaining fragments 
(e.g. PvuII-HincII and Aval-Aval}. The results are shown in Figure 1. 

Examination of the sequence confirms the presence of codons for 
35 the mature subtilisin corresponding to that secreted by the 
0992Y 
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8. amy] ol i quefaciens « Immediately upstream from this sequence is a 
series of XO? codons beginning with the GTS start codon at -107. 
Codon -107 to approximately codon -75 encodes an amino acid sequence 
whose characteristics correspond to that of known signal sequences* 
5 {Most such signal sequences are 18-30 amino acids in length, have 
hydrophobic cores, and terminate in a small hydrophobic amino acid.) 
Accordingly, examination of the sequence data would indicate that 
codons -107 to approximately -75 encode the signal sequence; the 
remaining intervening codons between -75 and -1 presumably encode a 
10 prosequence. 

Example 4 

15 Construct ion of pBS42 

pBS42 is formed by three-way ligation of fragments derived from 
pUBUO, pC194, and p8R322 {see Figure 5). The fragment from pUBHO is 
the approximately 2500 base pair fragment between the Hpall site at 

20 1900 and the SaraHl site at 4500 and contains an origin of replication 
operable in Bacillus: T. Grycztan, et aK, 1978 Bacteriol.", 134: 
318 {1978); A. Jalanko, et aj_., 1981 "Gene", 14; 325. The BamHI site 
was tested with Klenow. The pBR322 portion is the 1100 base pair 
fragment between the PvuII site at 2067 and the Sau3A site at 3223 

25 which contains the E. coli origin of replication: F. Bolivar, et a]_., 
1977 "Gene", 2: 95; J. Sutcliffe, 1978, Cold Spring Harbor Symposium 
43: I, 77. The pC194 fragment is the 1200 base pair fragment between 
the Hpall site at 973 and the Sau3A site at 2006 which contains the 
gene for chloramphenicol resistance expressible in both coli and IB. 

30 subtil is: S- Ehrlich, "Proc. Natl. Acad. Sci. (USA)", 74:1680; 
S, Horynuchi et aU, 1982, "<J. Bacteriol." 150: 815, 

pBS42 thus contains origins of replication operable both in 
coli and in Bacillus and an expressible gene for chloramphenicol 
35 resistance. 
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Example 5 

Isolation and Sequencing of the 8. subtil is Subtili sin Gene 

5 J.* s u bti l is 1168 chromosomal DM was digested with EcoRI and the 
fragments resolved on gel electrophoresis. A single 6 kb fragment 
hybridized to a [<*~ P3 CTP nick translation - labelled fragment 
obtained from the C- terminus of the subtil i sin structural gene in pS4, 
described above. The 6 kb fragment was electrol uted and 1 i gated into 

10 p8S42 which had been digested with EcoRI and treated with bacterial 
alkaline* phosphatase. £. coli ATCC 31446 was transformed with the 
ligation mixture and transforraants selected by growth on LB agar 
containing 12.5 chloramphenicol /ml . Plasmid DNA was prepared from 
a pooled suspension of 5,000 transformed colonies. This DMA was 

15 transformed into 8_, subtil is BG84, a protease deficient strain, the 
preparation of which is described in Example 8 below. Colonies which 
produced protease were screened by plating on LB agar plus 1.5 percent 
w/w Carnation powdered nonfat skim milk and 5 ug chloramphenicol /ml 
{hereafter termed skim milk selection plates) and observing for zones 

20 of clearance evidencing proteolytic activity. 

Plasmid DNA was prepared from protease producing colonies, 
digested with EcoRI, and examined by Southern analysis for the 
presence of the 6 kb EcoRI insert by hybridization to the 

25 32 P-labelled C- terminus fragment of the subtil i sin structural gene 
from B_. amy! oliquefaciens . A positive clone was identified and the 
plasmid was designated pS168.I. B. subtilis BG84 transformed with 
pS168.1 excreted serine protease at a level 5-fold over that produced 
1n subtilis 1168. Addition of EDTA to the supernatants did not 

30 affect the assay results, but the addition of PMSF 

{phenyl methyl sufonyl fluoride) to the supernatants reduced protease 
activity to levels undetectable in the assay described in Example 8 
for strain 8684. 
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A restriction map of the 6.5 kb EcoRI insert is shown in Fig. 6". 
The subtil i si n gene was localized to within the 2.5 kb KpnI-£coRI 
fragment by subcioning various restriction enzyme digests and testing 
for expression of subtilisin in B_. subtil is 8G84. Southern analysis 
5 with the labelled fragment from the C-terminus of the 

B. amyloliquefaciens subtilisin gene as a probe localized the 
C-terminus of the B_. subtil is gene to within or part of the 631 bp 
Hindi fragment S in the center of this subclone {see Fig, 6). The 
tandem Hindi fragments B, C, and D and HincII-EcoRI fragment £ 

10 (Fig. 6} were li gated into the M13 vectors mp8 or mp9 and sequenced in 
known fashion (0. Messing et aK, 1982, "Gene" 19:209-276) using 
dideoxy chain termination (F. Sanger et al_. , 1977, "Proc. Nat. Acad. 
Sci. U.S.A." 74:5463-5467). The sequence of this region is shown in 
Fig, 7. The first 23 amino acids are believed to be a signal 

15 peptide. The remaining 83 amino acids between the signal sequence and 
the mature coding sequence constitute the putative "pro" sequence. 
The overlined nucleotides at the 3* end of the gene are believed to be 
transcription terminator regions. Two possible Shine-Dai game 
sequences are underlined upstream from the mature start codon. 

20 

Example 6 

Manufacture of an Inacti vating Mutati on of the B. subtil is 
25 Subtilisin Gene 

A two step ligation, shown in Fig. 8, was required to construct a 
plasmid carrying a defective gene which would integrate into the 
Bacillus chromosome. In the first step, pS168.i, which contained the 

30 6.5 kb insert originally recovered from the B_. subtil is genomic 

library as described in Example 5 above, was digested with EcoRI, the 
reaction products treated with Klenow, the DMA digested with Hindi, 
and the 800 bp EcoRI -Hindi fragment £ {see Fig. 6) that contains, in 
part, the 5' end of the 8_. subtil is subtilisin gene, was recovered. 

35 This fragment was H gated into pJHIGl (pJHIOl is available from 
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J. Hoch {Scri pps) and is described by F.A. Ferrari et al», 1983, 
"J. Bact." 134: 318-329) that had been digested with Hindi and treated 
with bacterial alkaline phosphatase. The resultant plasmid, pIDVl, 
contained fragment E in the orientation shown in Fig. 8. In the 

5 second step, pS168.1 was digested with Hindi and the 700 bp Hindi 
fragment B» which contains the 3* end of the subtil isin gene, was 
recovered. pIDVl was digested at its unique Hindi site and 
fragment 8 li gated to the linearized plasmid, transformed in coif 
ATCC 31,446, and selected on LB plates containing 12. S pg 

10 chloramphenicol /ml or 20 ug ampicill in/ml . One resulting plasmid, 
designated pIDV1.4, contained fragment 8 in the correct orientation 
with respect to fragment E. This plasmid pIOVX.4, shown in Fig. 8, is 
& deletion derivative of the subtil isin gene containing portions of 
the 5? and 3' flanking sequences as well. 

15 

jB. subti 1 i s B677, a partial protease-deficient mutant {Prt ' ) 
prepared in Example 8 below was transformed with pIDV1.4. Two classes 
of chloramphenicol resistant (Cm r ) transformants were obtained, 
Seventy-five percent showed the same level of proteases as BG77 

20 {Prt*'™) and 25 percent were almost completely protease deficient 
{Prt"} as observed by relative zones of clearing on plates containing 
LB agar plus skim milk. The Cm r Prt"* trans formants could not be 
due to a single crossover integration of the plasmid at the homologous 
regions for fragment E or B because, in such a case, the gene would be 

25 uninterrupted and the phenotype would be Prt^". In fact, when 

either of fragments E or B were li gated independently into pJHIOl and 
subsequently transformed into 8_. subtflis BG77, the protease deficient 
phenotype was not observed. The Cm r phenotype of Cm r Prt" 
pIDV1.4 transformants was unstable in that Cm s Prt" derivatives 

30 could be Isolated from Cm r Prt" cultures at a frequency of about 
0.1 percent after 10 generations of growth in minimal medium In the 
absence of antibiotic selection. One such derivative was obtained and 
designated BG2018. The deletion was transferred into IA84 (a 8GSC 
strain carrying two auxotrophic mutations flanking the subtilisin gene} 

35 by P8S1 transduction. The derivative organism was designated 862019, 
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Example ? 

Preparation of a Genomic DM library from B. subti Hs and 
iso lation of Its Neutral Pr otease Gene 

5 

The partial amino acid sequence of a neutral protease of 
8, subtil is is disclosed by P. Levy et al_. 1975, "Proc. Mat. Acad. 
Set. USA" 72:4341-4345. A region of the enzyme (Asp Gin Net He Tyr 
Gly3 was selected from this published sequence in which the least 
to redundancy existed in the potential codons for the amino acids in the 
region, * 24 combinations were necessary to cover all the potential 
coding sequences, as described below. 

T 

15 GA J CA jj ATG AT J TA C GG 

Asp Sin Met He Tyr Gly 

Four pools, each containing six alternatives, were prepared as 
20 described above in Example 1. The pools were labelled by 
phosphoryllzation with £t- 32 p] ATP. 

The labelled pool containing sequences conforming closest to a 
unique sequence in a 8. subtil is genome was selected by digesting 

25 B. subtil is (1A72, Bacillus Genetic Stock Center) DM with various 
restriction enzymes, separating the digests on an electrophoresis 
gel, and hybridizing each of the four probe pools to each of the 
blotted digests under increasingly stringent conditions until a 
single band was seen to hybridize. Increasingly stringent 

30 conditions are those which tend to disfavor hybridization, e.g., 
increases in formamide concentration, decreases in salt 
concentration and increases in temperature. At 37 "C in a solution 
of 5x Oenhardt's, 5x SSC, 50 mM NaP0 4 p h 6.8 and 20 percent 
formamide, only pool 4 would hybridize to a blotted digest. These 

35 were selected as the proper hybridization conditions to be used for 
the neutral protease gene and pool 4 was used as the probe. 
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A lambda library of B, subtil is strain BGSC 1-A72 was prepared 
in conventional fashion by partial digestion of the Bacillus genomic 
DMA by Sau3A, separation of the partial digest by molecular weight 
on an electrophoresis gel, elution of 15-20 kb fragments {R. Lawn 
et art., 1981, "Nucleic Acids Res." 9:6103-6114), and ligation of the 
fragments to BamHI digested charon 30 phage using a Packagene kit 
from Promega Biotec. 

E. coli DPSOsupF was used as the host for the phage library, 
although any known host for Charon lambda phage is satisfactory. 
The E. col i host was plated with the library phage and cultured, 
after which plaques were assayed for the presence of the neutral 
protease gene by transfer to nitrocellulose and screening with probe 
pool 4 (Benton and Davis, 1977, "Science" 196:180-182). Positive 
plaques were purified through two rounds of single plaque 
purification, and two plaques were chosen for further study, 
designated xNPRGl and xNPRG2. OKA was prepared from each phage by 
restriction enzyme hydrolysis and separation on electrophoresis 
gels. The separated fragments were blotted and hybridized to 
labelled pool 4 oligonucleotides. This disclosed that xNPRGl 
contained a 2400 bp Hindi II hybridizing fragment, but no 4300 EcoRI 
fragment, while xNPRG2 contained a 4300 bp EcoRI fragment, but no 
2400 bp Hind III fragment. 

The 2400 bp xNPRGl fragment was subcloned into the Hindlll site 
of pOHlOl by the following method. xNPRGl was digested by Hindlll, 
the digest fractionated by electrophoresis and the 2400 bp fragment 
recovered from the gel. The fragment was 11 gated to alkaline 
phosphatase- treated Hindlll digested pJHIOl and the ligation mixture 
used to transform E_. coli ATCC 31446 by the calcium chloride shock 
method of V. Hershfield et aT 1974, "Proc. Mat. Acad. Sci. 
(U.S.A.)" 79:3455-3459). Trans forraants were identified by selecting 
colonies capable of growth on plates containing LB medium plus 
12.5 fjg chloramphenicol /ml . 
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Transforaant colonies yielded several plasmids. The orientation 
of the 2400 bp fragment in each plasmid was determined by 
conventional restriction analysis {orientation is the sense reading 
or transcriptional direction of the gene fragment in relation to the 
5 reading direction of the expression vector into which it is 

1i gated.} Two plasmids with opposite orientations were obtained and 
designated pNPRsubH6 and pNPRsubHl. 

The 4300 bp EcoRI fragment of xNPRG2 was subcloned into pBR325 
•JO by the method described above for the 2400 bp fragment except that 
ANPRG2 was digested with EcoRI and the plasmid was alkaline 
phosphatase- treated, EcoRI -digested pBR325. pBR325 is described by 
F. Bolivar, 1978, "Gene" 4:121-136. Two plasmids were identified in 
which, the 4300 bp insert was present in different orientations. 
ig These two plasmids were designated pNPRsubRI and pNPRsubRlb. 

Example 8 

20 Character ization of B. subtil is Neutral Protease Gene 

The pNPRsubHl insert was sequentially digested with different 
restriction endonucleases and blot hybridized with labelled pool 4 
in order to prepare a restriction map of the insert {for general 

2g procedures of restriction mapping see T. Maniatis et aj[. , Id., 
p. 377). A 430 bp Rsal fragment was the smallest fragment that 
hybridized to probe pool 4. The Rsal fragment was 11 gated into the 
Smal site of M13 mp8 (J. Messing et ajU, 1982, "Gene" 19:269-276 and 
J. Messing In Methods in Enzymology , 1983, R. Wu et aj_. , Eds,, 

30 101:20-78) and the sequence determined by the chain-terminating 
dideoxy method {F. Sanger ejt ajk, 1977, M Proc Nat, Acad, Sci, 
U.S.A." 74:5463-5467). Other restriction fragments from the 
pNPRsubHl insert were ligated into appropriate sites in M13 mp8 or 
M13 mp9 vectors and the sequences determined. As required, dITP was 

35 used to reduce compression artifacts (0, Mills et al_. , 1979, "Proc. 
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Mat. Acad. Sci. (U.S.A.)" 76:2232-2235). The restriction map for 
the pfiPRsubHI fragment is shown in Fig. 9. The sequences of the 
various fragments from restriction enzyme digests were compared and 
an open reading frame spanning a codon sequence translatable into 
5 the amino and carboxyl terrain! of the protease (P. Levy et aK, Id.} 
was determined. An open reading frame is a QUA sequence commencing 
at a known point which in reading frame (every three nucleotides) 
does not contain any internal termination codons. The open reading 
frame extended past the amino terminus to the end of the 2400 bp 

10 Hindi II fragment. The 1300 bp Bglll - Hindi II fragment was prepared 
from pNPRsubRIb (which contained the 4300 bp EcoRI fragment of 
X.MPRG2) and cloned in M13 mp8. The sequence of this fragment, which 
contained the portion of the neutral protease leader region not 
encoded by the 2400 bp fragment of pNPRsubHl, was determined for 400 

15 nucleotides upstream from the Hindi II site. 

The entire nucleotide sequence as determined for this neutral 
protease gene., including the putative secretory leader and prepro 
sequence, are shown in Fig. 10. The numbers above the line refer to 

20 amino acid positions. The underlined nucleotides in Fig. 10 are 
believed to constitute the ribosome binding { Shine-Dai garno) site, 
while the overlined nucleotides constitute a potential hairpin 
structure presumed to be a terminator. The first 27 - 28 of the 
deduced amino acids are believed to be the signal for the neutral 

25 protease, with a cleavage point at ala-27 or ala-28. The "pro" 
sequence of a proenzyme structure extends to the ami no-terminal 
amino acid (ala-222) of the mature, active enzyme. 

A high copy plasmid carrying the entire neutral protease gene 
30 was constructed by (Fig. 11) li gating the Bglll fragment of 

pNPRsubRl , which contains 1900 bp (Fig. 9), with the PvuII - Hindi I I 
fragment of pNPRsubHl, which contains 1400 bp . pBS42 (from 
Example 4} was digested with SamHI and treated with bacterial 
alkaline phosphatase to prevent plasmid recircularization. 
35 pUPRsubRl was digested with Bglll, the 1900 bp fragment was Isolated 
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from gel electrophoresis and 11 gated to the open Bamlil sites of 
P8S42. The If gated plasmid was used to transform E. coil ATCC 31446 
by the calcium chloride shock method {V. Hershfield et aT. , Id.), 
and transformed cells selected by growth on plates containing L8 

5 medium with 12.5 ng/ml chloramphenicol. A plasmid having the 8gl II 
fragment in the orientation shown in Fig. 11 was isolated from the 
transformants and designated pNPRsubBl. pNPRsubBl was digested 
(linearized) with EcoRI, repaired to flush ends by Kl enow treatment 
and then digested with Hindlll. The larger fragment from the 

10 Hindlll digestion {containing the sequence coding for the amino 
terminal "and upstream regions) was recovered. 

The carboxyl terminal region of the gene was supplied by a 
fragment from pNPRsubHl, obtained by digestion of pNPRsubHl with 

15 PvuII and Hindlll and recov&ry of the 1400 bp fragment. The flush 
end PvuII and the Hindlll site of the 1400 bp fragment was li gated, 
respectively, to the blunted EcoRI and the Hindlll site of ' 
pNPRsubBl, as shown in Fig. 11. This construct was used to 
transform B. subtil is strain 8684 which otherwise excreted no 

20 proteolytic activity by the assays described below. Transformants 
were selected on plates containing LB medium plus 1.5 percent 
carnation powdered nonfat milk and 5ag/ial chloramphenicol . Plasmids 
from colonies that cleared a large halo were analyzed. Plasmid 
PNPRIQ, incorporating the structural gene and flanking regions of 

25 the neutral protease gene, was determined by restriction analysis to 
have the structure shown in Fig. 11. . 

£• subtil is strain BG84 was produced by N~methyl~N'-nitro-N- 
nttrosoguanidine (NTG) mutagenesis of B_. subtil is 1168 according to 

30 the general technique of Adelberg et aj_., 1965, "Biochem. Biophys. 
Res. Coramun." 18:788-795. Mutagenized strain 1168 was plated on 
skim milk plates {without antibiotic). Colonies producing a smaller 
halo were picked for further analysis. Each colony was 
characterized for protease production on skim milk plates and 

35 amylase production on starch plates. One such isolate, which was 
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partially protease deficient, amylase positive and capable of 
sports] a Hon, was designated BS77. The protease deficiency mutation 
was designated prt-77. The prt-77 allele was moved to a spoOA 
background by congression as described below to produce strain BG84, 
5 a spoliation deficient strain. 



Table A 



10 Strain Relevant Genotype 



origin 



15 



H68 trpC2 

JH703 trpC2, pheA12 , spoOAA677 

BG16 ' purB6, metBS, leuAS, lys-21 , hisA, thr-5 

sacA321 

BG77 trpC2 s prt-77 

BG81 metBS , prt-77 

20 BG84 spo0a677, prt-77 



Trousdale e t a1. a 
Pb 1665 

NTG x 1168 
8G16 DMA X BG77 
JH703 DHA x BG81 



a «ho1. Gen. Genetics" 173:61 (1979) 



25 BG84 was completely devoid of protease activity on skim milk 
plates and does not produce detectable levels of either subtilisin 
or neutral protease when assayed by measuring the change in 
absorbance at 412 nm per minute upon incubation with 0.2 ng/ml 
succinyl (-L-ala-L-ala-L-pro-L-phe) p-nitroanilide (Vega) in 0.1 M 

30 sodium phosphate, pH 8, at 25°C. BG84 was deposited in the ATCC as 
deposit number 39382 on July 21, 1983. Samples for subtilisin assay 
were taken from late logarithmic growth phase supernatants of 
cultures grown in modified Schaeffer's medium (T. Leighton et aT., 
1971, B J. Biol. Chem." 246:3189-3195). 

35 
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Example 9 

Expression of the Neutral' Protease Gene . 

BG84 transformed with pNPRlO was inoculated into minimal media 
supplemented with 0.1 percent casein hydrolysate and 10 ng 
chloramphenicol and cultured for 16 hours. O.I ml of culture 
supernatant was removed and added to a suspension of 1.4 mg/ml 
Azocoll proteolytic substrate {Sigma} in 10 mM THs-HCl , 100 nM-NaCI . 
pH $.8 and incubating with agitation. Undigested substrate was 
removed -by centrifugation and the optical density read at 505 rtm. 
Background values of an Azocoll substrate suspension were 
subtracted. The amount of protease excreted by a standard 
pro tease-expressing strain, 8G16 was used to establish an arbitrary 
level of 100. The results with 8G16, and with BG84 transformed with 
control and neutral protease gene-containing plasmids are shown in 
Table 8 in Example 12 below. Transformation of the excreted 
protease-devoid B. subtil is strain 8684 results in excretion of 
protease activity at considerably greater levels than in BG16, the 
wild- type strain. 

Example 10 

Manufacture of a n Inactivating Mutation of the Meutral Protease 
Gene 



The two Rsal bounded regions in the 2400 bp insert of pNPRsubHl, 
totalling 527 bp, can be deleted in order to produce an incomplete 
structural gene. The translational products of this gene are 
enzymatically inactive. A plasmid having this deletion was 
constructed as follows. pJHIOl was cleaved by digestion with 
Hindi II and treated with bacterial alkaline phosphatase. The 
fragments of the neutral protease gene to be incorporated into 
linearized pJHIOl were obtained by digesting pNPRsubHl with Hindi 1 1 
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and Rsal, and recovering the 1200 bp Hindlll-Rsal and 680 bp 
Rsal-HindllT fragments by gel electrophoresis. These fragments were 
11 gated into linearized pOHlOl and used to transform IE. coli 
ATCC 31446. Trans formants were selected on plates containing LB 
5 medium and 20 pg ampicill in/ml. Plasmids were recovered from the 
transformants and assayed by restriction enzyme analysis to identify 
a plasmid having the two fragments in the same orientation as in the 
pNPRsubHl starting plasmid. The plasmid lacking the internal Rsal 
fragments was designated pNPftsubHU. 

10 

Exampl e 11 

Re placement of the N eutral Protease Gene with a Deletion Mutant 

15 

Plasmid pHPRsubhU was transformed into subtil i s strain 
8G20X9 {the subtilisin deleted mutant from Example 6) and 
chromosomal integrants were selected on skim milk plates. Two types 
of Cm r transformants were noted, those with parental levels of 

20 proteolysis surrounding the colony, and those with almost no zone of 
proteolysis. Those lacking a zone of proteolysis were picked, 
restreaked to purify individual colonies, and their protease 
deficient character on skim milk plates confirmed. One of the 
Cm r , proteolysis deficient colonies was chosen for further studies 

25 {designated BG2G34). Spontaneous Cm 5 revertants of BG2034 were 
isolated by overnight growth in LB media containing no Cm, plating 
for individual colonies, and replica plating on media with and 
without Cm. Three Cm s revertants were isolated, two of which were 
protease proficient, one of which was protease deficient (designated 

30 BG2036). Hybridization analysis of BG2036 confirmed that the 

plasmid had been lost from this strain, probably by recombination, 
leaving only the deletion fragments of subtilisin and neutral 
protease. 

35 
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Example 12 

Phenotype of St rain s Lacking Functional Subtil i sin and Meutral 
Protease 

The growth, sporulation and expression of proteases was examined 
in strains lacking a functional gene for either the neutral or 
alkaline protease or both. The expression of proteases was examined 
by a zone of clearing surrounding a colony on a skim milk plate and 
by measurement of the protease levels in liquid culture supernatants 
{Table B). A strain (BG2035) carrying the subtil i sin gene deletion, 
and showed a 30 percent reduction level of protease activity and a 
normal halo on milk plates. Strain BG2043, carrying the deleted 
neutral protease gene and active subtil isin gene, and constructed by 
transforming BS16 {Ex. 8) with DMA from BG2036 (Example 11), showed 
an 80 percent reduction in protease activity and only a small halo 
on the milk plate. Strain BG2054, considered equivalent to BG2036 

Table B 

Effect of protease deletions on protease expression and sporulation. 



Genotype 3 Protease activity* * Percent Sporulation 

BG16 Wild type 100 40 

BG2035 aprA684 70 20 

BG2043 nprEA522 20 20 

BG2054 aprA684,nprE&522 NO 45 

BG84(pBS42) spoOA&677,prt~77 NO 

8G84(pNPR10) spoOAA677,prt~77 3000 

a 0nly the loci relevant to the protease pheno type are shown. 
^Protease activity is espressed in arbitrary units, BG16 was assigned a 
level of 100. MD indicates the level of protease was not detectable in 

the assay used- " . . . • 
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{Example 11 > in that it carried the foregoing deletions in both 
genes, showed no detectable protease activity in this assay and no 
detectable halo on milk plates. The deletion of either or both of 
the protease genes had no apparent effect on either growth or 
5 spoliation. Strains carrying these deletions had normal growth 
rates on both minimal glucose and LB media. The strains sporulated 
at frequencies comparable to the parent strain BG16. Examination of 
morphology of these strains showed no apparent differences from 
strains without such deletions. 

10 

Example 13 

Site-specific Saturation Mutagenesis of the B. Amyloliquefaciens 
Subtil 1 sin Gene at Position 222; Preparation of the Gene for 
15 Cassette Insertion 

pS4-5, a derivative of pS4 made according to Wells et a]L» 
"Nucleic Acids Res.% 1983, 11:7911-7924 was digested with EcoRI and 
BatnHI, and the 1.5 kb EeoRJ-BamHl fragment recovered. This fragment 

20 was ligated into replicative form M-13 mp9 which had been digested 
with EcoRI and BamHI (Sanger et aJU, 1980, M. Mol . Biol." 143 
161-178. Messing et aj_, 1981, "Nucleic Acids Research" 9, 304-321. 
Messing, J. and Vieira, J. {1982} Gene 19, 269-276). The M-13 mp9 
phage ligations, designated M-13 mp9 SUBT, were used to transform 

25 E. coli strain JM101 and single stranded phage DNA was prepared from 
a two mL overnight culture. An oligonucleotide primer was 
synthesized having the sequence 

5 ' -GTACAACGGTACCTCACGCACGCTGCAGGAGCGGCTGC-3 ' . This primer conforms 
to the sequence of the subtil is gene fragment encoding amino acids 

30 216-232 except that the 10 bp of codons for amino acids 222-225 were 
deleted, and the codons for amino acids 220, 227 and 228 were 
mutated to introduce a Kpnl site 5' to the raet-222 codon and a PstI 
site 3' to the met+222 codon. See Fig. 12. Substituted nucleotides 
are denoted by asterisks, the underlined codons in line 2 represent 

35 the new restriction sites and the scored sequence in line 4 
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represents the inserted oligonucleotides. The primer (about 15 uM} 
was labelled with [ 32 p] by incubation with Ct 32 p3-ATP (10 #1 in 
20 ul reaction) {Amersham 5000 Ci/mmol, 102183 and T 4 
polynucleotide kinase (10 units) followed by non-radioactive ATP 
g {100 (iH) to allow complete phosphorylation of the mutagenesis 

primer. The kinase was inactivated by heating the phosphorylation 
mixture at 68"C for 15 rain. 

The primer was hybridized to M-13 rap9 SU8T as modified from 
Norris et al_., 1983, "Nucleic Acids Res." 11, 5103-5112 by combining 
5 jiL of the labelled mutagenesis primer ("3 yM) , ~1 y g M-13 mp9 SU8T 
template, 1 ul of 1 yM M-13 sequencing primer (17-mer), and 2.5 til 
of buffer (0.3 M Tris pH 8, 40 mM MgC! 2 , 12 mM EDTA, 10 mM DTT, 
0.5 mg/inl BSA). The mixture was heated to 68°C for 10 minutes and 
cooled 10 minutes at room temperature. To the annealing mixture was 
added 3.6 »l of 0.25 mN dGTP, dCTP, dATP, and dTTP, 1.25 »l of 10 mM 
ATP, 1 ul ligase {4 units) and 1 »L Klenow {5 units). The primer 
extension and ligation reaction {total volume 25 w l) proceeded 
2 hours at 14*C. The Klenow and ligase were inactivated by heating 
to 68*C for 20 min. The heated reaction mixture was digested with 
8amHl and EcoRI and an aliquot of the digest was applied to a 6 
percent polyacryl amide gel and radioactive fragments were visualized 
by autoradiography. This showed the [ 32 P] mutagenesis primer had 
indeed been incorporated into the EcoRI-BamHl fragment containing 
the now mutated subtilisin gene. 

The remainder of the digested reaction mixture was diluted to 
200 (iL with 10 mM Tris, pH 8, containing 1 mM EDTA, extracted once 
with a 1:1 (v:v) phenol/chloroform mixture, then once with 
30 chloroform, and the aqueous phase recovered. 15 nL of 5M ammonium 
acetate CpH 8) was added along with two volumes of ethanol to 
precipitate the DNA from the aqueous phase. The DMA was pelleted by 
centrifugation for five minutes in a microfuge and the supernatant 
was discarded. 300 uL of 70 percent ethanol was added to wash the 
35 DMA pellet, the wash was discarded and the pellet lyophil ized. 
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p8S42 from example 4 above was digested with BamHl and EcoRI and 
purified on an acryl amide gel to recover the vector. G.Sng of the 
digested vector, SQpM ATP and 6 units ligase were dissolved in 20 yl 
of ligation buffer. The ligation went overnight at 14°C. The DNA 
was transformed into I. coli 294 rec + and the transformants grown 
in 4 ml of LB medium containing 12.5 ng/ml chloramphenicol . Flasmid 
DNA was prepared from this culture and digested with Kpnl, EcoRI and 
BamHI. Analysis of the restriction fragments showed 30-50 percent 
of the molecules contained the expected Kpnl site programmed by the " 
mutagenesis primer. It was hypothesized that the plasraid population 
not including the Kpnl site resulted from M-13 replication before 
bacterial repair of the mutagenesis site, thus producing a 
heterogenous population of Kpnl + and Kpnl" plasmids in some of 
the transformants. In order to obtain a pure culture of the Kpnl 
plasmid, the DNA was transformed a second time into E.. coli to clone 
plasmids containing the new Kpnl site. DNA was prepared from 16 
such transformants and six were found to contain the expected Kpnl 
site. 

Preparative amounts of DNA were made from one of these six 
transformants (designated p&222) and restriction analysis confirmed 
the presence and location of the expected Kpnl and PstI sites. 40 
ng of pa222 were digested in 300 vl of Kpnl buffer plus 30 p.1 Kpnl 
(300 units) for 1.5 h at 3?'C. The DNA was precipitated with 
ethanol , washed with 70 percent ethanol , and lyophilized. The DNA 
pellet was taken up in 200 »L Hindlll buffer and digested with 20 nL 
(500 units) PstI for 1.5 h at 37*0. The aqueous phase was extracted 
with phenol/CHC! 3 and the DMA precipitated with ethanol. The DNA 
was dissolved in water and purified by polyacryl amide gel 
electrophoresis. Following electroelution of the vector band {120 v 
for 2 h at 0°C in 0.1 times THE (Maniatis et al_.» Id.)} the DNA was 
purified by phenol /CHC1 3 extraction* ethanol precipitation and 
ethanol washing. 
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Although pA222 could be digested to completion (>98 percent) by 
either Knpl or PstI separately, exhaustive double digestion was 
incomplete {«50 percent). This may have resulted from the fact 
that these sites were so close {10 bp) that digestion by Knpl 
allowed "breathing" of the DNA in the vicinity of the PstI site, 
i.e., strand separation or fraying. Since PstI will only cleave 
double stranded DM, strand separation could inhibit subsequent PstI 
digestion. 

Example 14 

ligation of Oligonucleotide Casettes into the Subtil i sin Gene 

10 uH of four complementary oligonucleotide pools (A-D, Table 1 
below) which were not 5* phosphorylated were annealed in 20 til 
ligase buffer by heating for five minutes at 68*C and then cooling 
for fifteen minutes at room temperature. 1 \M of each annealed 
oligonucleotide pool, "0.2 Kpnl and Pstl-digested p&222 obtained 
in Example 13, 0.5 mH ATP, ligase buffer and 6 units T^ DNA ligase 
in 20 jiL total volume was reacted overnight at 14°C to ligate the 
pooled cassettes in the vector. A large excess of cassettes ( ~300x 
over the p&222 ends) was used in the ligation to help prevent 
intramolecular Kpnl-Kpnl ligation. The reaction was diluted by 
adding 25 al of 10 m Tris pH 8 containing 1 mH EDTA. The mixture 
was reannealed to avoid possible cassette concatemer formation by 
heating to 68'C for five minutes and cooling for 15 minutes at room 
temperature. The ligation mixtures from each pool were transformed , 
separately into E. coif 294 rec 4 cells. A small aliquot from each 
transformation mixture was plated to determine the number of 
independent transformants. The large number of transformants 
indicated a high probability of multiple mutagenesis. The rest of 
the transformants ("200-400 transformants) were cultured in 4 ml of 
LB medium plus 12.5 pg chloramphenicol /ml . DMA was prepared from 
each transformation pool (A-D). This DNA was digested with Kpnl, 
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"0.1 iig was used to retransferro E. c o l i rec* and the mixture was 
plated to isolate individual colonies from each pool. Ligation of 
the cassettes into the gene and bacterial repair upon transformation 
destroyed the Kpnl and PstI sites. Thus, only pa222 was cut when 
the transformant DNA was digested with Kpnl. The cut plasmid would 
not transform E. coli . Individual transforaants were grown In 
culture and DMA was prepared from 24 to 26 transformants per pool 
for direct plasmid sequencing. A synthetic oligonucleotide primer 
having the sequence 5 ' -GAGCTTGATGTCATGGC-3 ' was used to prime the 
dideoxy sequencing reaction. The mutants which were obtained are 
described in Table C below. 

Two codon+222 mutants (i.e., gin and ile) were not found after 
the screening described. To obtain these a single 25mer 
oligonucleotide was synthesized for each mutant corresponding to the 
top oligonucleotide strand in Figure 12. Each was phosphorylated 
and annealed to the bottom strand of its respective 
nonphosphorylated oligonucleotide pool (i.e., pool A for gin and 
pool D for ile). This was ligated into Kpnl and PstI digested pA222 
and processed as described for the original oligonucleotide pools. 
The frequency of appearance for single mutants obtained this way was 
2/8 and 0/7 for gin and ile, respectively. To avoid this apparent 
bias the top strand was phosphorylated and annealed to its 
unphosphorylated complementary pool. The heterophosphorylated 
cassette was ligated into cut p&222 and processed as before. The 
frequency of appearance of gin and i.le mutants was now 7/7 and 7/7, 
respectively. 

The data in Table C demonstrate a bias in the frequency of 
mutants obtained from the pools. This probably resulted from 
unequal representation of oligonucleotides in the pool . This may 
have been caused by unequal coupling of the particular trimers over 
the mutagenesis codon in the pool. Such a bias problem could be 
remedied by appropriate adjustment of trimer levels during synthesis 
to reflect equal reaction. In any case, mutants which were not 
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isolated in the primary screen were obtained by synthesizing a 
single strand oligonucleotide representing the desired mutation, 
phospharylating both ends, annealing to the pool of 
non-phosphorylated complementary strands and ligating into the 

s cassette site, A biased heteroduplex repair observed for the 

completely unphosphorylated cassette may result from the fact that 
position 222 is closer to the 5* end of the upper strand than it is 
to the 5' end of the lower strand {see Figure 12). Because a gap 
exists at the unphosphorylated 5* ends and the mismatch bubble in 

1£} the double stranded DMA is at position 222, excision repair of the 
top strand gap would more readily maintain a circularly hybridized 
duplex capable of replication. Consistent with this hypothesis is 
the fact that the top strand could be completely retained by 
selective 5' phosphorylation. In this case only the bottom strand 

15 contained a 5* gap which could promote excision repair. This method 
is useful in directing biased incorporation of synthetic 
oligonuclotide strands when employing mutagenic oligonucleotide 
cassettes. 



20 

Example IS 

Site-Specific Mutagenesis of the Subtil isin Gene at Po sition 166 

2g The procedure of Examples 13-14 was followed in substantial 
detail, except that the mutagenesis primer differed {the 37 mer 
shown in Fig. 13 was used), the two restriction enzymes were Sac I 
and Xmalll rather than PstI and Kpnl and the resulting constructions 
differed, as shown in Fig. 13. 

30 

Bacillus strains excreting mutant subtilisins at position 166 
were obtained as described below in Example 16. The mutant 
subtilisins exhibiting substitutions of ala, asp, gin, phe* his, 
lys, asn, arg, and val for the wild-type residue were recovered. 
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Example 16 

Preparation of Mutant Subtilisin Enzymes 

I* subtnis strain BG2036 obtained by the method of Example 11 
was transformed by the plasraids of Examples 14, IS or 20 and by 
pS4-5 as a control. Transformants were plated or cultured in shaker 
flasks for 16 to 48 h at 37*C In LB media plus 12.5 ug/ml 
chloramphenicol. Mutant enzymatieally active subtil i sin was 
recovered by dialyzlng cell broth against 0.01M sodium phosphate 
buffer, pH 6.2. The dialyzed broth was then titrated to pH 6.2 with 
IN HC1 and loaded on a 2.5 x 2 cm column of CM cellulose (CM- 52 
Whatman). After washing with 0.01M sodium phosphate, pH 6.2, the 
subtilisins (except mutants at position +222) were eluted with the 
same buffer made 0.08M in NaCI . The mutant subtilisins at position 
+222 were each eluted with 0.1M sodium phosphate, pH 7.0. The 
purified mutant and wild type enzymes were then used in studies of 
oxidation stability, Km, Kcat, Kcat/Km ratio, pH optimum, and 
changes in substrate specificity. 




35 
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Table C Oligonucleotide Pool Organization 
and Frequency of Mutants Obtained 



Pool 


Amino Acids Codon-222a 


Frequency* 5 


A 


asp 


GAT 


2/25 




met 


ATS 


*> toe 




cys 


T6T 


13/25 




arg 


AGA 


2/25 




gin 


GAA 


0/25 




unexpected mutantsa 




5/25 


8 


leu 


CTT 


1/25 


- 


pro 


CCT 


3/25 




phe 


TTC 


6/25 




tyr 


TAG 


5/25 




his 


CAC 


1/25 




unexpected mutants 




9/25 


C 




GAA 


3/17 




ala 


GCT 


3/17 




thr 


ACA 


1/17 




lys 




1/17 




asn 


AAC 


1/17 




unexpected mutants 




8/17 


D 


gly 


GGC 


1/23 




trp 


TGG 


8/23 




ile 


ATC 


0/23 




ser 


A6C 


1/23 




val 


GTT 


4/23 




unexpected mutants 




9/23 



Codons were chosen based on frequent use in the cloned 
subtil i sin gene sequence (Wells et al . , 1983, Id.). 

Frequency was determined from single track analysis by direct 
plasmid sequencing. 

Unexpected mutants generally comprised double mutants with 
changes in codons next to 222 or at the points of ligation. 
These were believed to result from impurities in the 
obigonucleotide pools and/or erroneous repair of the gapped 
ends. 
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Example 17 

Mutant Subtilisin Exhibiting Improved Oxidation Stability 

Subtil i sins having cysteine and alanine substituted at the 222 
position for wild-type methionine {Example 16} were assayed for 
resistance to oxidation by incubating with various concentrations of 
sodium hypochloride (Clcrox Bleach). 

To a total volume of 400 ul of 0.1M, pH 7, NaP0 4 buffer 
containing the indicated bleach concentrations {Fig. 14) sufficient 
enzyme was added to give a final concentration of 0.016 rag/ml of 
enzyme. The solutions were incubated at 25*C for 10 rain, and 
assayed for enzyme activity as follows: 120 ul of either al a+222 or 
wild type, or 100 ul of the cys+222 incubation mixture was combined 
with 890 ul 0.1M tHs buffer at pH 8,6 and 10 ul of a sAAPFpfci 
{Example 18} substrate solution {20 mg/ral in DMS0). The rate of 
increase in absorbance at 410 nm due to release of p-nitroaniline 
{Del Mar, E.G., et al.., 1979 "Anal. Biochean" 9S, 316-320) was 
monitored. The results are shown in Fig. 14. The alanine 
substitution produced considerably more stable enzyme than either 
the wild-type enzyme or a mutant in which a labile cysteine residue 
was substituted for methionine. Surprisingly, the alanine 
substitution did not substantially interfere with enzyme activity 
against the assay substrate, yet conferred relative oxidation 
stability on the enzyme. The serin'e+222 mutant also exhibited 
improved oxidation stability. 

Example 18 

Mutant Subtil i sins Exhibiting Modified Kinetics and Substrate 
Specificity 

Various mutants for glycine+166 were screened for modified 
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Kcat, Km and Kcat/Km ratios. Kinetic parameters were obtained by 
analysis of the progress curves of the reactions. The rate of 
recti on was measured as a function of substrate concentration. Data 
was analyzed by fitting to the Michael is -Menton equation using the 
5 non-linear regression algorithm of Marquardt {Marquardt, D, tf. 1963, 
M. Soc. Ind. Appl. Math." 11, 431-41). All reactions were 
conducted at 25°C in 0.1H tris buffer, pH 8-6, containing 
benzoyl -L-Yalyl -61 ycyl -L-Arginyl -p-ni troani 1 i de [BYGRpM; Vega 
Biochemical s) at initial concentrations of 0.0025 M to 0.00026 M 
10 {depending on the value of Km for the enzyme of interest - 

concentrations were adjusted in each measurement so as to exceed Km) 
or succ inyl -L-Al any! -l-Al anyl -UProlyl -L-Phenyl al anyl -p-ni tro- 
ani lide (sAAPFpN; Vega Biochemical s) at initial concentrations of 
0.0010 M to 0.00028 H (varying as described for BVGRpN). 

15 

The results obtained in these experiments were as follows: 

Table D 



20 Substrate Enzyme Kcat Cs-*? Km W) Kcat/Km 





sAAPFpN 


gly-166(wild type) 


37 


1.4xl0~ 4 


3 x 10 5 






ala+166 


19 


2.7xKT 5 


7 x 10 S 






asp+166 


3 


5.8xl0" 4 


5 x 10 3 


25 




glu+166 


11 


3.4xl0~ 4 


3 X 10 4 






phe+166 


3 


1.4xl0" 5 


2 x 10 5 






hys+166 


15 


1.U10" 4 


1 x 10 5 






lys+166 


15 


3,4xl0" 5 


4 x 10 5 






asn+166 


26 


1.4xl0~ 4 


2 x 10 5 


30 




arg+165 


19 


6.2xl0" 5 


3 x 10 5 






val+166 


1 


1.4xl0" 4 


1 x 10 4 




BVGRpN 


Wild Type 


2 


l.lxlO" 3 


2 x I0 3 






asp+166 


2 


4.1xl0" 5 


5 x 10 4 






glu+166 


2 


2.7xl0~ 5 


7 x 10 4 


35 




asn+166 


1 


1.2xlG~ 4 


8 x 10 3 
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The Kcat/Km ratio for each of the mutants varied from that of 
the wild-type enzyme. As a measure of catalytic efficiency, these 
ratios demonstrate that enzymes having much higher activity against 
a given substrate can be readily designed and selected by screening 
5 in accordance with the invention herein. For example, A166 exhibits 
over 2 times the activity of the wild type on sAAPFpH. 

This data also demonstrates changes in substrate specificity 
upon mutation of the wild type enzyme. For example, the Kcat/Km 
ratio for the 0166 and E166 mutants is higher than the wild type 
enzyme with the BVGpN substrate, but qualitatively opposite results 
were obtained upon incubation with sAAPFpN . Accordingly, the D166 
and E166 mutants were relatively more specific for BYSRpfi than for 
sAAPFpN, 

Example 19 

Mutant Subtil i sin Exhibiting Modified pK -Activity Profile 

The pH profile of the Cys+222 mutant obtained in Example 16 was 
compared to that of the wild type enzyme. 10yl of 60 mg/ml sAAPFpN 
in DMSO, 10 iil of Cys+222 (0.18 mg/ml ) or wild type (0.5 mg/ml) and 
980 pi of buffer {for measurements at pH 6.6, 7.0 and 7.6, 0.1H 
NaPG 4 buffer; at pH 8.2, 8.6 and 9.2, 0.1M tris buffer; and at pH 
9.6 and 10.0, 0.1M glycine buffer), after which the initial rate of 
change in absorbance at 410 nm per minute was measured at each pH 
and the data plotted in Fig. 15. The Cys+222 mutant exhibits a 
sharper pH optimum than the wild type enzyme. 

30 Example 20 

Site-Specific Mutag enesis of Subtf Ifstn G ene at Position 169 

The procedure of Examples 13-14 was followed in substantial 
35 detail, except that the mutagenesis primer differed {the primer 

0992V 



-si- 0130756 



shown in Fig. 16 was used), the two restriction engines were Kpnl 
and EcoRV rather than PstI and Kpnl and the resulting constructions 
differed, as shown in Fig. 16. 

5 Bacillus strains excreting mutant subtil i sins at position 169 

were obtained as described below in Example 16. The mutant 
subtil i sins exhibiting substitutions of ala and ser for the 
wild-type residue were recovered and assayed for changes in kinetic 
features. The assay employed SAAPFpJJ at pH 8.6 in the same fashion 
10 as set forth in Example 18. The results were as follows: 

Table E 

Enzyme Kcat (s" 1 ) Km (M) Kcat/Km 

ala+169 58 7.5 x 10" 5 8 x 10 5 

15 ser+169 38 8.5 x 10" 5 4 x 10 5 



20 



30 



Example 21 

Alterations in Specific Activity on a Protein Substrate 



Position 166 mutants from Examples 15 and 16 were assayed for 
alteration of specific activity on a naturally occuring protein 
substrate. Because these mutant proteases could display altered 
25 specificity as well as altered specific activity, the substrate 
should contain sufficient different cleavage sites i.e., acidic, 
basic, neutral, and hydrophobic, so as not to bias the assay toward 
a protease with one type of specificity. The substrate should also 
contain no derivitized residues that result in the masking of 
certain cleavage sites. The widely used substrates such as 
hemoglobin, azocollogen, azocasein, dimethyl casein, etc., were 
rejected on this basis- Bovine casein, a and <* 2 chains, was 
chosen as a suitable substrate. 

35 
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A 1 percent casein {w/v) solution was prepared in a 100 mM Iris 
buffer, pH 8.0, 10 mM EDTA. The assay protocol is as follows: 

790 yl 50 m Tns pH 8.2 
5 100 ul 1 percent casein {Sigma) solution 

10 pi test enzyme {10-200 ug). 

This assay mixture was mixed and allowed to fncubate at room 
temperature for 20 minutes. The reaction was terminated upon the 

10 addition of 100 *il 100 percent trichloroacetic acid, followed by 
incubation for 15 minutes at room temperature. The precipitated 
protein was pelleted by centr if ligation and the optical density of 
the supernatant was determined spectrophotometries! ly at 280 mi* 
The optical density is a reflection of the amount of unpreci pita ted, 

15 i.e., hydrolyzed, casein in the reaction mixture. The amount of 
casein hydrolysed by each mutant protease was compared to a series 
of standards containing various amounts of the wild type protease, 
and the activity is expressed as a percentage of the corresponding 
wild type activity. Enzyme activities were converted to specific 

20 activity by dividing the casein hydrolysis activity by the 280 nm 
absorbance of the enzyme solution used in the assay. 

All of the mutants which were assayed showed less specific 
activity on casein than the wild type with the exception of Asn+166 
25 which was 26 percent more active on casein than the wild type. The 
mutant showing the least specific activity was ile+166 at 0.184 of 
the wild type activity, 



30 
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CLAIMS 

1. A method of preparing a procaryotic carbonyl hydrolase 
which comprises; culturing a recombinant host cell 

5 transformed with an expression vector comprising the iMh 
sequence encoding the hydrolase, and recovering the 
hydrolase from the cell culture. 

2. The method of claim 1 wherein the hydrolase is a 

10 protease, preferably a subtilisin or a metalloprotease , most 
preferably a subtilisin, the DNA sequence preferably 
encoding subtilisin in the form of prosubtilisin or 
preprosubtil is in . 

15 3. The method of claim 1 wherein the recombinant host 
cell is a strain of Bacillus , preferably a Bacil lus 
subtil is. 

4. The method of claim 2 wherein the DMA sequence 
20 encoding subtilisin is operably linked to its native 
promoter, to a promoter homologous to the host which is 
other than the native promoter, or to a promoter which is 
heterologous to the host. 

25 5. The process of claim 2 wherein the recombinant host 
cell was transformed with an effective expression vector for 
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the DSA sequence encoding the procaryotic protease operably 
linked to its signal sequence. 

6. A composition comprising a procaryotic carbonyl 

5 hydrolase, preferably a Bacillus hydrolase, and a host 

microorganism transformed so as to be capable of expressing 
the hydrolase. 

7. A composition comprising prepro-, pre- or procarbonyl 
10 hydrolase, preferably prosubtilisin, essentially free of 

cells which express said prepro-, pre- or procarbonyl 
hydrolase. 

8. A liquid detergent composition comprising B. 
15 amy lol i quef ac ien s subtilisin. 

9. An expression vector for a procaryotic carbonyl. 
hydrolase which comprises a DMA sequence encoding the 
hydrolase operably linked to a promoter compatible with a 

20 suitable host cell. 

10. A recombinant expression vector comprising a DMA 
sequence encoding a prepro- or procarbonyl hydrolase, 
preferably subtilisin, operably linked to a promoter 

25 compatible with a suitable host cell? or a cell, preferably 
a strain of Bacillus , transformed by a said vector. 
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11. A method comprising: 

£a) isolating a DNA moiety encoding a procaryotic 
carbonyl hydrolase; 

(b) introducing a mutation into a predetermined region 
in the DNA which, upon expression of the DNA, results in the 
substitution, deletion or insertion of at least one amino 
acid at a predetermined site in the hydrolase; 

and optionally 

(c) transforming a suitable host with the mutated DNA 
of step (b) and recovering the expression product of the 
mutated DNA. 

12. The method of claim 11 wherein the mutation is 
predetermined; preferably expressed as the substitution or 
insertion of a single amino acid. 

13. The method of claim 11 wherein the DNA was isolated as 
a fragment of genomic DNA from an organism expressing the 
carbonyl hydrolase; the fragment preferably consisting 
essentially of the structural gene for the hydrolase. 

14. The method of claim 11 wherein the hydrolase is a 
protein hydrolase or lipase, preferably Bacillus subtilisin, 
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prosubtilisin, preprosubtilisin, a metal lopro tease or other 
peptide hydrolase. 

15. The method of claim 14 wherein the' hydrolase is 

5 Bacillus subtilisin and the mutation is introduced into the 
sufotilisin at aspartate+32, asparagine+155 , tyrosine+104 , 
methionine+222, glycine+166, histidine+64, serine+221, 
glycine+169, glutamate+156, serine+33, phenylalanlne+189, 
tyrosine+217 and/or alanine+152. 

10 

16. The method of claim 14 wherein the hydrolase is 
prosubtilisin or preprosubtilisin and the mutation is 
introduced into the presubtilisin or preprosubtilisin at 
tyrosine- 1 . 

15 

17. The method of claim 11 wherein the mutation is 
expressed as a mutant carbonyl hydrolase exhibiting a change 
in one or more of the oxidation stability, Km, Kcat, Kcat/Km 
ratio, substrate specificity, specific activity or pH 

20 optimum of the hydrolase. 

18. The method of claim 14 wherein the hydrolase is a 
peptide hydrolase selected from a-aminoacy Ipept ide 
hydrolase, peptidylami no-acid hydrolase, acy lamina 

25 hydrolase, serine carboxypeptidase, metallocarboxypeptidase, 
thiol proteinase, carboxylproteinase or metalloproteinase. 
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19. DNA encoding a predetermined mutant of a procaryotie 
carbonyl hydrolase; or a vector capable of transforming a 
host cell to produce a mutant bacterial carbonyl hydrolase, 
which vector comprises such DUA and which, upon 

5 transformation of the host cell, results in expression of 
the mutant hydrolase; or a host cell transformed with a said 
vector. 

20. A mutant carbonyl hydrolase of the kind obtained by 
10 the method of claim 11. 



21. A composition comprising the hydrolase of claim 20 in 
combination with a detergent., preferably a liquid detergent 
or a detergent in granular form; optionally additionally 

15 comprising a builder, bleach or fluorescent whitening agent. 

22. The composition of claim 11 wherein the detergent is a 
linear alkyl benzene sulfonate, alkyl ethoxylated sulfate, 
sulfated linear alcohol or ethyoxylated linear alcohol. 

20 

23. The method of claim 17 wherein a mutant exhibiting a 
different substrata specificity is recovered. 

24. The method of claim 11 wherein the mutation is within 
25 the enzyme active site. 



-58- 



0130756 



25. The method of claim 11 wherein the DNA is expressed in 
a bacterial host cell, preferably a Bacillus species. 

26. The method of claim 11 wherein the mutation is 

5 expressed as the substitution of one or more methionine, 
tryptophan, cysteine or lysine residues by a substituent 
amino acid residue not one of methionine, tryptophan, 
cysteine or lysine, preferably alanine or serine. 

10 27. The method of claim 17 wherein the mutation renders 
the mutant enzyme either less oxidation stable or more 
oxidation stable than the precursor enzyme. 

28. DMA encoding a predetermined enzymatically active 

15 mutant of a precursor procaryotic carbonyl hydrolase, said 
mutant exhibiting a different substrata specificity, 
oxidation stability and/or pH-activity profile than the 
precursor enzyme; or a vector capable of transforming a host 
ceil to produce a mutant enzyme, which vector comprises such 

20 mh and which, upon transformation of the host cell, results 
in expression of the mutant enzyme; or a host cell 
transformed with such a vector. 

29. A method comprising culturing a host cell of claim 28 
25 and recovering therefrom the mutant enzyme . 
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30. A substantially normally speculating Bacillus which is 
incapable of excreting subtilisin or neutral protease. 

31 * The Bacillus of claim 30, preferably free of Bacillus 
5 strains capable of excreting subtilisin or neutral protease, 
and comprising a mutant neutral protease gene, preferably 
nonrevertible, which gene contains a deletion that results- 
in no expression or, upon expression, an ensymatically 
inactive polypeptide. 

10 

32, The Bacillus of claim 30 which is (a) incapable of 
excreting neutral protease and (b) transformed with at least 
one DNA moiety encoding a polypeptide not otherwise 
expressed by the Bacillus , preferably a subtilisin mutant or 

13 a eucaryotic protein. 

33. A vegetative-phase Bacillus culture which is 
essentially free of neutral protease, or which is 
essentially free of subtilisin. 

20 

34 * A Bacillus culture free of any gene capable of 
expressing ensymatically active neutral protease, or free of 
any gene capable of excreting enzymatically active 
subtilisin. 

25 

35. A vector comprising a deletion mutated neutral 
protease gene or a deletion mutated subtilisin gene. 
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36. A method comprising eulturing the Bacillus of claim 30 
until a protein not subtilisin or neutral protease, 
preferably amylase, has accumulated in the culture, and 
recovering the protein. 

5 

37. A method comprising: 

(a) obtaining a DMA moiety encoding at least a portion 
of said precursor protein? 

10 

(b) identifying a region within the moiety; 

(c) substituting nucleotides for those already 
existing within the region in order to create at least one 

15 restriction enzyme site unique to the moiety, whereby unique 
restriction sites 5' and 3' to the identified region are 
made available such that neither alters the amino acids 
coded for by the region as expressed; 

20 (d) synthesizing a plurality of oligonucleotides , the 

5* and 3' ends of which each contain sequences capable of 
annealing to the restriction enzyme sites introduced in step 
(c) and which, when ligated to the moiety, are expressed as 
substitutions, deletions and/or insertions of at least one 

25 amino acid in or into said precursor protein; 
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(e) digesting the moiety of step {c} with restriction 
enzymes capable of cleaving the unique sites; 

{£} ligating each of the oligonucleotides of step (d) 
5 into the digested moiety of step (e> whereby a plurality of 
mutant DSA. moieties are obtained? 

and optionally the further steps of 

10 <g> expressing each of said moieties as a mutant 

protein in a suitable host; 

(h) recovering the mutant proteins of step (g); and 

15 (i) screening the step (h) mutant proteins for the 

desirable characteristic. 

38. The method of claim 37 wherein the restriction enzyme 
sites are different. 

20 

39. The method of claim 37 wherein the oligonucleotides 
are less than about 50 bp. 
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A. 

a»RI C(a\ PvuW Bom HI 



P RBS^-PflEMW MAT TERM 



8. 



C§) (3) C4 



3 liSTOFACtAAAATAT TATreCATAjlTTATACMn&ATACACAi^tAATCTCTCTAi t1%T?6TTC1<WyAA WAAAAM8 <*fiSA81W iTAflAS& SIS 

..\m PRE jw 

*r« S!v (.w tv* »*1 Tro JH Ser ten Lift- «ie AH Ls« Ala let' He Ahe The »*t Mi M» 61 y Ser Thr Sir 
W Afi* SSSC AAA AM ST* TSf, ATC AST T?0 Ctf, TTT f.CT TTA ACS HA A TC TTT ACT ATS. CSS TTC W ASC « TCC 

-Ht PRO ~«n 

$«r AH S' 
J?< TCT 5CC 0 



Pi I R8S 



-at PRO ~m 

H «)• S)v tv« Ser Asr. GJv SH i.vs i.vs Tyr 1 ie Val t!ty »he Us Sin Thr Pst Ser Tht" «et 
CS CCA Sr.fi AAA TCA AAC CSC 6AA A AC AAA 1ST ATT STC «SS TTT AAA CAS ACA ATS A8C ACS AT6 



„fir> -An 
.".is Ata Ivs tvs tvs Hp Va) Me Ser SH Lys Civ Sir Lvs V«T 51a Ivs Sin Che tvs Tyr »»1 Asn Ala 
?A< ASC GCC (HTF AAA AAA AAA SAT SIC ATT TCT MA AAA (SfiC «AS AAA fits fAA AAS CAA TTC AAA TAT fiTA SAC m 

-?s -?n -in 

AT A s Sr A? a Thr !_?:: ft*n filo ivs A? a y*t lv^ RT« teti t.v* i.vs Asp Pro See Val AH Tyr »*) filti «1u AsO 
3»« OCT TCA GC? ACA TTA AAC SAA AAA GCT STA AAA SAA TTC AAA AAA SAC CCS A«C 6TC 6CT TAC «TT RRA AAA RAT 

. 5fT ^MAT 

His »»T AH nH Ata T»r Alt Stn Ser Val Am Tyr G)y V«! Ser Gin H« Iy» AH f : r<5 Ala Leo RlJ Ssr CS)n 

m sac s» sc* sat sec, tag pes -as tcc bts «cr tag arc ata tea caa att aas sr.;: cct sct cm cac tct caa 
m Si in 

Sly !«• Thr Siy Ser Asn V»1 tvs ¥»l AH Vat iH A«A Ser Gly IH Asf> Ser See «1« Pro Asa Ten i.v« ¥*1 
«•?< 86S TAC ACT SRA ICA AAT (<U AAA G'A SCA STT ATC RAt ASC fiST ATC SAT TCT TCT CAT CCT BAT TTA AAA fiTA 

SO ?re Asn Af> Asji 



Sri Pro ASK »" ASji 

AU SSy 51 y AT* Ssr «*t »*1 »re Ssr 6 hi Shr Asn fro ?(« Sirs A» Asn Asn Kpr lii 5 G!v Vtsr ifis V«l Ata 
5*> SCA W« G5A KCC AAC ATA KT! CCS TC! SAA AC* AAT CCT TTC CAA (SAC AAC AAC TCT CAC (Hi* ACT CAC STT fiCC 

"A> ft"! Ssr JTa an 

Civ T*r V<»' AT a AH U>e As:: Asn fer Us SJy Visf l*» Sly V,\! AH Cro Sw AH So* Uw Tvf A 
m U8C ACA STT ACS SCT CTT AAT AAC TCA ATC SfiT fiTA TTA (35C STT 6C8 CCA ASC GCA TCA CTT TAC ft 



Spr AH Oft 

H Lvs 
CCT CTA AAA 



Asn Ala W J» 
/3t («u Sir AH Ass 61 y Ser C'iy ST« Tyr Ser Trn Me Me Asrs >?ly Me Shi Tra AH 11? AH Asn Asn *t 

aoa an err ear ser <wc «st tcc r«c caa tac a«c tw atc att aac cra atc <m m man atc sca «ac aat ats 

tJCi J:« H" 

Asn W ]l« Asf. «cl Set U« filv 81 y *"r» SW filv Ser AT« Al* Lra Lvs AH AU V»T A*» tvs AH V*«1 AH 

m m, sti ATI - aac ats «;c ctc tmc sua cct tct m tct sct sct ha aaa scr, rca rtt m m set «rr csca 

15« Sep thr !fi!! 

S*f Slv i'al if* I Vj) !As: AH AH AH Sty Asn SH lily Thr Ssr Sty S«r ^pr Sftr Thr V»t Sly Tw Pro 81 r 
»A9 T(;C CSC A^ 1TA ISTC ATT SOI WA «CT. «!T AAC «M S8C ACT TCC SSC ASK TCA ARC ACA KTfi fiSC TAC CCT fifit 



1 m im j ic 

LyS Tyr Vra Vr "Hi Me *l* V*1 «Jy AH Vat Asp Ser Ser asr sla Arn AH Ser Phe Ser Ser 'HI Sly Pro 
9?« ACA TAC CCT TCT (TTC ATT SCA fiTA fiCC SCT STT SAC AGC ASC ASC CAA ASA fif.A TCT TiC TCA ABC STA SKA CCT 

Sin Uu A59 V»1 «st Ala fro SH Vai Ser JH GH Ser TKr !.e« Pro Cly Asr. I.vs Tvr C1y AH Tvr Asn EH 
«<» SAfi CTT SAT GTC ATS SCA SCT BSC CTA TCT ATC CAA ABC ACT, CTT CCT RCA AAC AAA TAC SSR SCR TAC AAC CRT 

m ?a» ?4fs 

Thr Ser ntt AH S*r Pre His Vst AH STy Aia *)« Ats Leu Me te« Ser tvs SHs 9ra Asn Tro Thr Asn Thr 
tm ACS TCA ATS 6CA TCT CCS CAC C»TT 6CC SB> (SCR (TCT SCT TTS ATT CTT TCT AAC CAC CCS AAC Tfifi ACS AAC ACT 

?SA SH yfin 
61n »*1 Arf! Sw Ser lea Gin Asn Thr Thr The (y» t,P<( Siy Asp Set- Ptic Tyr Tyr QTy Lys GTv Lew Me A$* 
) ]«« CAA «TC CSC AAC Af.T TTA SAA AAC ACC Art ACA AAA CTT tST SAT TCT TTC TAC TAT «j» AAA «»t CTC ATC AAC 

?W ?7S TtTOAA 

W SH AH AH AH Stn «C * "™ 

!??A RTA CAS ACS SCA SCT CAT- TAA Af S feT^aflAA^C*-'TSC C1TCCCCCC ACCC : ATTTTTTATT A TTTTTC TTCCTCCr.C-ATSTTCAATrrRCTrS 

13i.fi A!AATCSA(:r^ATSCv:TCCCTC1SAAAATTTTASC(»6AAACSSCSSS^ 



\m CTTCCCSATTTrrSC,TrA&CTrAfTSC"r,TAACC,^TCAOC«RCBTTnCCTSATAC 
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Fig. 2. 
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Fig! 



1 6ATATACC?A«ATA6AeftT*«A*7C^TCtCAAAAAAATKi8TCT*CT*i»lwrATTftTrcCATCTATTACAAT*AftTTCACA6ftftTA6TCTTTTA«STAAS 

-100 

f»et Arg Ser Lys Lys Leu Trj> lie Ser l.eu Leu F'fte Ala Leu Vhr Leu 
101 TCTACTCTGAATTTTTTTAAAASGAGASGSf AAAGA 6TG S6A AGC AAA AAA TTG TB6 ATC AGC TTG TTS Tit GCG TTA ACQ TTA 

-90 -80 „?a 

He Phe Thr Met Sli Pha Ser Ask Met Scr Ala GH AU C1y Lys Ser Set- Thr Stu Lys Lys Tvr He Val 
185 AFC TIT ACS ATG SCS TTC AGC ASC ATS TCT SCS CAfS SCT SCC G6A ASA ASC AST ACA GAA AAfi AAA TAC ATI STC 

-60 -50 
Sty Pfce Lys Gin Thr Met Ser A1a Met Ser Ser Ala Lys Lys Lys Asp Val lis Ser Glu Lys Gly 61 y Lys Val 
GftA rtT AAA CA« *C* ATS ACT aft. ATft ACT TfC fir*- hi; AAA Sic ft s T err itr rrt Ml »«* Mf ee. «»» 



~*0 -30 -23 

GU Lys Sin Phe Lys Tyr Val Asn Ala Ala Ala Ala Thr Lea Asp Cto Lys AH ¥»» Lys 61a Lea Lys lv% Asp 

335 CAA ASS CAA ITT A AS TAT GTT AAC 6CS SCC SCA SCA ACA TTS SAT GAA AAA GCT STA AAA GAA TTG AAA AAA SAT 

-10 ~1 3 16 

Pro Ser Val AH Tyr Va! Slu GU Asp His tie Ala Bis SH Tyr AH 61i; Set- Vsl Fro Tyr 6iv If e Ser SH 

AW CCG AGC 6TT SCA TAT 6TS GAA GAA GAT CAT ATT 6CA CAT GAS TAT 6CS CAA TCT GTT CCT TAT GGC ATT TCT CAA 

20 30 32 

lie lys Ala Pro Ala Leu His Ser Sir Gly Tyr Thr Gly Ser Asn ¥»} Lys Val Ala Val lie Asp Ser Gly lie 

<85 ATT AAA SCO CCG GCT CTT CAC TCT CAS 66C TAC ACA GGC TCT AAC STA AAA STA GCT GTT STC SAC ASC SSA ATT 

40 50 SO 

Asp Ser Ser His fro Asp Law Asn Val Arg Sly Sly AU Ser Phe Va1 Pre Ser Gin Thr Asn Pro Tyr Gin Asp 

560 SAC TCT TCT CAT CCT GAC TTA AAC C.TC ASA SGC CGA GCA AGC TTC GTS CCT TCT BAA ACA AAC CCA TAC CAG 6AC 



Sly Ser Ser His Gly Thr His Va 1 Ala Gly Tdr He Ala AH Lau Asn'Asr, Ser He Sly Val Lea Sly Vai Ssr 
635 8SC AST TCT CAC GST ACS CAT GTA GCC GGf ACG ATT GCC GCT CTT A AT AAC TCA ATC GST GTT C" 



64 TO 80 

Val 

CTG GGC GTT ASC 

90 100 HO 

Pro Ser AH Ser Leu Tyr Ala Val Lys Val Leu Asp 5sr Thr Glv Ser Gty Gin Tyr Ser Trp lie lie Asn Gly 

?10 CCA AGC SCA TCA 17 A TAT GCA GTA AAA GTS CTT SAT TCA ACA EGA ASC BSC CAA TAT AGC TGG ATT ATT AAC GGC 

120 i30 

tie Slu Trp AH l!« Ser Asn Asn Set Ssp Vi i ile Asn Wet Ser Leu Gly Sly Pro Thr Gly Ser Thr Ala Leu 

?SS ATT GAG TGG GCC ATT TCC AAC AAT ATC- GAT GTT ATC AAC ATS AGC CTT GSC GCA CCT ACT SGT TCT ACA GCG CTG 

140 ISO ISO 

Lys Tfir tfal Val Asp Lys AU Val Ser Ser Gly lie ¥3l Val Ala Ala AU Ata Gly Asn Gltt Sly Ser Ser Sly 

860 AAA ACA GTC GTT CAC AAA GCC GTT TCC AGC SGT ATC STC GTT GCT SCC GCA GCC SSA AAC GAA GST ICS TCC GGA 

1?0 180 

Ser Thr Ser Thr Val Gly Tyr Pro A!a Lys Tyr Pro Ser Thr He Ala Val G1y AT a Val Asn Ser Ser Asrt Sirs 

S35 ASC ACA AGC ACA GTC GGC TAC CCT SCA AAA TAT CCT TCT ACT ATT SCA STA GGT GCG STA SAC AGC ASC AAC CAA 

190 200 ?1G 

Arc Ala Ser Phe 5er Ser Ala Gly Ser Shi Lea Asp l*al Met Ala Pro Gly Val Ser lie Sin Ser Tfir Leu Pro 

1610 AGS GCT TCA TTC TCC AGC GCA GST TCT GAG CTT SAT GTG ATS GCT CCT S6C STG TCC ATC CAA ASC ACA CTT CCT 

220 m 330 

Gly Gly Thr Tyr Sly Ala Tyr Asn Gly Thr Ser Net AU Thr Pro His Val AU Sly Ala Aia Ala Lea lie Leu 

1085 SSA SSC ACT TAC GGC GCT TAT AAC SSA ACS TCC ATG SCG ACT CCT CAC Sit GCC EGA SCA SCA SCS TTA ATT CTT 

240 250 26C 

Ser Lys His Pro Thr Trp Thr Asn AU GU Val Arg Asp Arg Leu Slu Ser Thr AU Thr Tyr Leu Sly Asn Ser 

1160 TCT SAG CAC CCS ACT TG3 ACA AAC 6C6 CAA GTC CGT GAT CST TTA GAA ASC ACT BCA ACA TAT CTT GSA AAC TCT 

2?fJ 

Phe Tyr Tyr Gly Lys Sly Leu ile Asn (fat Sin Ala Ala Ala Gin OC 

1?.1S TTC TAC TAT 6GA AAA GSG TTA ATC AAC GTA CAA SCA GCT SCA CAA TAA TAGTAAAAAGAAGCASGTTCCTCCATACCTGCTTC 



1518 TTTTTATTTGTCAGCATCtTSATGTTCCGSCGEATTCTCTTCTTTW 

ma ACAAGCACCSGAGSATCAACCTSCTCAGCCCCG-TCACGSCCAAATCCTGAAAeGTTTTAACACTGSCTTCTCTGTTCTCTGTC 
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